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ARRAVSOFNUCIEICACIDP^^^^^ ^aatoTl^tttS^^^^^ 

BIOLOGICAL CHIPS ^.^^^ h^ivfcca a target and reference sequence, including 

^oncc DPTTPPPMPF TO RELATED single or multiple base substitutions, and deletions and 

CROSS-REreRENCE^^ insertions .of bases, as well as detecting the presence, 

APPLICATION 5 ^^^^^.^^ sequence of other more complex variations 

Hiis is a Continuation of application Ser. No. 08/143,312, between a target and reference sequence in a nucleic acid, 

filed Oct 26 1993 now abandoned, which is a continuation jhe present invention provides arrays of ^gonucieoiiae 

S naS of U S patent application Ser. No. 082,937, filed 25 probes immobUized on a solid support. The arrays are 

1993 now b^^ incorporated herein by refer- preferably synthesized directly on /he support us ng 
jun. lyyj, now du , lo ^LSjpgTw technology, but other synthesis methods and 

. . c A A' ^.rf h« immobUization of pre-synthesized oUgonucleotide probes 

Research leading to the invention was funded in part by ^^^^^ oligonucleotide probe arrays, called 

NIH grant No. 1R01HG00813.01 and DOE grant No. S^^^^Xs" of the invention. In general, these arrays 

DE.FG03-92-ER81275, and the government may have cer- ^^^^^^^ ^ set of oligonucleotide probes such that, for each 
Uin rights to the invention. is ^^se in a specific reference sequence, the set includes a 

^^v. nrnhe rcalled the "wild-tvpe" or "WT' probe) that is exactly 

BACKGROUND OF THE INVENTION Jction of the reference sequence 

1 Field of the Invention including the base of interest and four additional probes 

probes immobilized in microfabncated patterns on sdic 20 ^^^'J^^^^^ 

chips for analyzing molecular interactions of biolo^cal ^^f^^J^^^^din^.^t, one of the four substitution probes is 

interest. The invention therefore relates to diverse neias r^^^j-^^j ^^^^ ^ype probe; the other three are comple- 

impacted by the nanire of molecular interaction, mcluding ^ ^^^^ ^^^^ ^ single-base substitution at this 

chemistry, biology, medicine, and medical diagnostics. position. 

2. Description of Related Art jn another aspect, the invention relates to the arrangement 

Olieonucleotide probes have long been used to detect of individual probes in the array. In one embodiment, the 

complementary nucleic acid sequences in a nucleic acid of probes are arranged on the chip so that Pfo^^^sjor a given 

btSthe 'larger nucleic acid). In some assay fomiats, position in the sequence are adjacent, and Probes fo^^^^^ 

Sl Xonuci probe is te hercd, i.e., by covalent cent posiUons in the reference sequence are also adjacent to 

ttUrn?^^^^^ of oligonucleotide one another on the chip. One method arranges the probes for 

detect specific nudeic add sequen^^^^ orXritS 5^^^^ 

acid. See, e.g.. PCT patent publication Nos. WO 89 10^^^^ fnd each of the substitution probes have specified positions 

and 89/11548. Others have proposed the use of large num andeacn ^ ^^^^ corresponding to an 

bers of oligonudeotide probes to provide the complete "^'^^^^^^^^^^ ^^.^pj,^ ^ ^ single row. Tlie stripes 

nudeic acid sequence of a target nucleic but taiieo lo separated on the chip by a blank row or column, 

provide an enabling method for using arrays of "^mobilized ^^^^ invention can be made in a wide 

probes for this purpose. See U S. Pat. Nos. 5 202,231 and ^^^^^^ of variations. For some applications, leaving out the 

5,002,867 and PCT patent publication No. WO 93/17126. ^-^^^^y^^ row, leaving out unimportant bases, pooling bases, 

The development of VLSIPS^" technology has provided including insertion and deletion probes, varying the length 

methods for making very large arrays of oligonudeotide jjj^ ^^q^^^ within a set to make the probes have the same 

probes in very small arrays. See U.S. Pat. No. 5,143,854 and or similar Tm relative to the target or to avoid secondary 

PCT patent publication Nos. WO 90/15070 and 92/10092, structure, varying the mutation position, usmg multiple 

each of which is incorporated herein by reference. U.S. 45 p^o^es for a single mutation, providing replicate probes or 

patent application Ser. No. 082.937, filed Jun. 25. 1993, ^^^ys, placing blank "streets" (no probe) between rows, 

describes methods for making arrays of oligonudeotide columns, or individual probes, and using control probes may 

probes that can be used to provide the complete sequence of appropriate. 

a target nucleic acid and to detect the presence of a nucleic present invention also provides DNA chips for detect- 

acid containing a specific nucleotide sequence. 50 jng mutations assodated with cystic fibrosis, including 

Microfabricated arrays of large numbers of oligonude- mutations in exons 4. 7, 9, 10, 11, 20, and 21 of the CFTR 

otide probes, called "DNA chips" offer great promise for a ge^e jhe invention also provides DNA chips for detecting 

wide variety of applications. New methods and reagents are mutations in the p53 gene, a gene in which mutations are 

required to realize this promise, and the present invention j^Qwn to be assodated with a wide vanety of cancers^Other 

helps meet that need. 55 DNA chips of the invention provide probe arrays for detect- 
ing specific sequences of mitochondrial DNA, useful tor 

SUMMARY OF THE INVENTION identmcation and forensic purposes. The invention also 

Iht present invention provides methods for making high- provides DNA chips for detecting spedfic sequences of 

density arrays of oligonucleotide probes on siUca chips and nucleotides or mutations associated with the acquisition ot a 

for usmg those probe arrays to detect specific nucleic acid 60 drug resistant phenotype in an infectious organism, such as 

sequences contained in a target nucldc acid in a sample. The rifampicin or other drug resistant TB strams and HIV, in 

invention also provides arrays of oligonucleotide probes on which mutations in an RNA polymerase gene are known to 

DNAchips, in which the probes have spedfic sequences and give rise to drug resistance, 

locations in the array to facilitate identification of a specific ^^^^^ DESCRIPTION OF THE DRAWINGS 

target nudeic add. In another aspect, the invention provides 65 ^^^^^.^^ 

methods for detecting whether one or more specific Fia ^J^^^^^^ ^J^^^ , , , „^eleic acid, 

sequences of a target nudeic acid in a sample varies from a defines a set of DNA prooes reiaiivc g 
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,n .he agu.. .he .arge. DN^ .o.ecu.e. .be probes .e " rnt^inT^efB^^^^^^^ 

SZ]|.e:U'(wi. .0 .be AP508 

no. 2 shows illustrative .Ued array of .he mven.io.. mutaUon) m m u ^ panels A aad 

wi.hprobesforthede.ec.ionofpom.mu.a.,on^^^^^^ ' B S FlJ TsJows gr^s orfluoLenJ m^^^^^^^ 

, Oic posi.ion of subs.ini..0D in each of .be wild-type probes « °' f^^e^ hotizon.al axis show .he 

is shown in the wUd-lype lane, and the shadmg shows the f^a^^yp, corresponding to .he posi- 

loca.ion of .he substitution probe having Ihc wUd-type ^^ "^i.u.ion to .he respective probes. Plo..ed are the 

sequence. The SEQ ID. NOS. ^'^^^''^du.g to th two ^""^^^g'^^^f^'^^fa^^^^ (or synthesis sites) 

peptidesequencesshownmthetopportionoEHG.2are3n ^"^"^^^ ™ ^es, the features containing the 

and 312. respectively. The SEQ ID. NOS. c^^^^^^ ^bSn5«L.'bound .hemostta,getraU^ 

the five peptide sequences hsted at the bottom of FIG. 2 are uDsu p 

313, 314, 315, 313. and 316. respectively. tarEet with*e»«'°dhi^^^^ 

HG. 3, in paneU A, B, and C, shows an image made bom Highesr). The SEQ ID NOS. corresponding to 

the region of a DNAchip containing CFTR exon 10 probes; ^ . ' eotide sequences shown in sheet 2 of HG. 8 are 332 

in panel A, the chip was hybridized to a wild-type target; m respectively 

panel C, the chip was hybridized to a ■»';»°»^f 508 large.; ■ ^^^^^ MoniM genome; "0„" is 

and in panel B, .he chip hybri^-tEO ^ r^ the H s.rand origin of replica.ion, and arrows indica.e .he 

wild-lvoe and mutant targets. The bbU nj. inu:>. corrc u.^„/c««»^n^ 
Sofi to the four peptide sequences shown in FIG. 3 are ^0 cloned unshaded sequence. 

7^0 ^.nLS FIG. 10 shows the image observed from apphca ion of a 

317-320, respective y. mitochondrial DNA derived nucleic acid (from 

FIG 4 in sheets 1-3, corresponding to panels A, U, ana sduipi^ 

C 0 FIG 3 shows graphs of fluorescence intensity versus the mt4 sample) on a DNAchip 

Sing po^^^^^^^^ f£ on the horizontal axis sLw the FIG. 11 is similar to FIG. 10 but shows the m.age 

bases in the wild-type sequence corresponding to the posi- observed from the mt5 sample 

tion of substitution in the respective probes. Plotted are the FIG. 12 shows the predicted difference image between the 

intensities observed from the features (or synthesis sites) mt4 and mt5 samples on the DNAchip based on mismatches 

containing wild-type probes, the features containing the between the two samples and the reference sequence, 
substitution probes that bound the most target ("called"), and 3^ ^IG. 13 shows the actual difference image observed for 

the feature containing the substitution probes that bound the ^nd mt5 samples. 

target with the second highest intensity of all the substitution ^ greets 1 and 2, shows a plot of normalized 

probes ("2nd Highest"). The SEQ ID. NOS. corresponding intensities across rows 10 and 11 of the array and a tabula- 

to the two peptide sequences shown in sheet 1 of FIG. 4 are ^^^^ mutations detected. 

321 and 318, respectively; the SEQ ID. NOS. corresponding 3^ ^^^^^ discrimination between wild-type and 
to the two peptide sequences shown in sheet 2 of FIG. 4 are ^^^^^^ hybrids obtained with the chip. A median of the six 

322 and 318, respectively; and the SEQ ID. NOS. corre- ^^^^^^^^-cd hybridization scores for each probe was taken; 
sponding to the two peptide sequences shown in sheet 3 ot ^^^^ ^^^^ ^^^j^ j^j^ j^j^q 5^0^^ to the normal- 
FIG. 4 are 323 and 318, respectively. hybridization score versus mean counts. A ratio of 1.6 

FIG. 5, in panels A, B, and C, shows an image made from 40 ^^^^^ ^^^^^ a^ove 50 yield no false positives, 
a region of a DNA chip containing CFTR exon 10 probes; illustrates how the identity of the base mismatch 
in panel A, the chip was hybridized to the wt480 target; in influence the ability to discriminate mutant and wild- 
panel C, the chip was hybridized to the mu480 target; and in , ^ sequences more than the position of the mismatch 
panel B, the chip was hybridized to a mixture of the ^.^^^^ oUgonucleotide probe. The mismatch position is 
wild-type and mutant targets. The SEQ ID. NOS. corre- 45 ^ ^^^j ^ % p^be length from the 3'-end. The base 
sponding to the peptide sequences shown in FIG. 5 are ^^^^^^ indicated on the graph. 
324-327, respectively. yj provides a 5* to 3' sequence Usting of one target 

RG. 6, in sheets 1-3. corresponding to panels A, B, and corresponding to the probes on the chip. X is a control probe. 

C of FIG. 5, shows graphs of fluorescence intensity versus p^^ijio^s ^^at differ in the target (i.e., are mismatched with 

tiling position. The labels on the horizontal axis show the 50 ^^^^ ^^^^ designated site) are in bold. The SEQ ID. 

bases in the wUd-type sequence correspondmg to the posi- corresponding to the peptide sequence shown in FIG. 

tion of substitution in the respective probes. Plotted are the ^ 333 

intensities observed from the features (or synthesis sites) ^^^^^ fluorescence image produced by scan- 
containing wild-type probes, the features ^otaming the ^ ^^^^ hybridized to a 
substitution probes that bound the most target ("called' ), and 55 "^"S V 

the feature containing the substitution probes to • ^^^^^^^^ ^^e 

'^^S'^S;^^^^ .0 %';G:20showsthealignmentofsomeoftheprobeson^^ 

M^vSl sequences shown in sheet 2 of HG. 6 are p" DNA chip with a ^-er -odeUa^g^^^ ^^7^^^^^ 

330 and 329, respectively; and the SEQ ID. NOS. corre- SEQ ID. NOS. correspondm^^^ ll^^^"^^^^^ 

st)ondiDe to the two peptide sequences shown in sheet 3 of sequences shown in FIG. 20 are 334-347 respectiveiy. 

™ 6 are 3^^^^^ respectively. FIG. 21 shows a set of lO-mer probes for a p53 exon 6 

region of a DNA chip containing CFTR exon 10 probes; in pepUde sequences shown in FIG. 21 are 334 and 348-359, 

panel A, the chip was hybridized to nucleic acid denved respectively. 
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FlG. 22 shows that very distinct patters are observed ^ ^h^^^^^ WhVw 

after hybridization of p53 DNA chips with targets having ^^^S™^^^ as the number of nucleotides 

different 1 base substitutions. In the first image in FIG. 22, me p i^.stranded nucleic acid probe. For pur- 

the 12.mer probes that form perfect matches with the the presem invention, lengths ranging from 12 to 18 

wild-type target are in the first row (top). The 12-mcr probes 5 preferred, although shorter and longer lengths can 

with single base mismatches are located in the second, third, employed. To employ the tiling method, one syn- 

and fourth rows and have much lower signals. ^^^^^.^^ ^ ^ ^ ^ ^ ^^^^^^ ^^^^^^ particular nucleotide 

FIG. 23, in graphs 2, 3. and 4, graphicaUy depicts the data • j^jterest in the target nucleic acid. For each base 

in HG. 22. On each graph, the X ordinate is the position of ^^^^^^ DNAsegment, one synthesizes a probe comple- 

Ihe probe in its row on the chip, and the Y ordmate is the ^^g^iary to the subsequence of the target nucleic acid begin- 

signal at that probe site after hybridization. ^^^^ ^ase and ending L-1 bases to the 3'-side (see 

FIG. 24 shows the results of hybridizing mixed target 

populations of WT and mutant p53 genes to the p53 DNA ^ preferred embodiment of the invention, the probes are 
chip. , 15 arranged (either by immobilization, typically by covalent 

FIG 25, in graphs 1^, shows (see FIG. 23 as weU) the attachment, of a pre-synthesized probe or by synthesis of the 

hybridization efficiency of a 10-mer probe array as com- ^^^^^ substrate) on the substrate or chips m lanes 

pared to a 12-mer probe array stretching across the chip and separated, and these lanes are 

FIG 26 shows an image of a p53 DNA chip hybridized to j^^^^d arranged in blocks of preferably 5 lanes, although 
a target DNA 20 blocks of other sizes wUl have useful application, as will be 

HG 27 illustrates how the actual sequence was read from apparent from the following illustration. The tirst ot mese 

the chip shown in HG. 26. Gaps in the sequence of letters lanes, called the "wild-type lane", contams probes 

in the WT rows correspond to control probes or sites. arranged in order of sequence, and all ot the prooes are 

Positions at which bases are miscaUed are represented by complementary to a specified wild-type nucleic acia 

letters in iulic type in cells corresponding to probes in which ^5 sequence. The other four lanes contain probe sets tor detect- 

theWTbaseshavebeensubstitutedbyotherbases-TheSEQ ^^g all possible single-base mutations m the delined 

ID NO corresponding to the peptide sequence shown in sequence; in turn, these probe sets are defined by a position 

FIG 27 is 360 of potential non-complementarity in the probe relative to the 

FIG 28 Ulustrates the VLSIPS™ technology as appUed to target (i.e., a single base mismatch) and the identity of the 

the Ughtdirectedsynthesisof oligonucleotides. Ught(hv)is 3, nucleotide in the probe at that P°^^^^^^^^^^^^ 

shone through a mask (M,) to activate ftmctional groups nucleotide ^ an A, C, G, or T nucleotide) Tlie posiUon ot 

STn f surface by removal of a protecting group (X). mismatch, also called the position of substitution, is^ ^er- 

Nucleoside buUding blocks protected with photoremovable ably selected to be near the center of the probes, i.e., position 

protecting groups (T-X, C-X) are coupled to the activated 7 of a probe of L=l 5. , 

areas By repeating the irradiation and coupling steps, very 35 por each probe in the wild-type lane, one synthesizes tour 

complex arrays of oligonucleotides can be prepared. probes (one for each of the lanes other than the wild-type 

m 29 illustrates how the VLSIPS™ process can be used lane), Three of these four probes is identical to the corre- 

to prepare "nucleoside combinatorials" or oHgonucleotides spending wOd-typc probe but for the base at the position ot 

synthesized bv coupUng all four nucleosides to form dimers, substitution, and the remaimng probe is identical to ine 

U^LsTt^ 40 wild-type probe. Tliis set of four substitution probes is 

HG.'bO shows the deprotection, coupling, and oxidation preferably placed in a ^^^^^J^,^^^^^^^^ 

steps of a solid phase DNAsynthesis method. corresponding ^^l^-^f^f^^vfo^^^^^^^ 

HG, 31 shows an illustrative synthesis route for the C-lane, a G- ane, and ^/'l^^^; ™ f 0^^^^^ 

nudeoside buUding blocks used in the VI^IPS- method. ^iled arra^^^^^ ^ 

FIG. 32 shows a preferred photoremovable protecting 45 P-"JJ~,JJ^ 
group, MeNPOC. and how to prepare the group m active 

form. - . J . , havine the wild-type sequence. Below are the probes that 

HG. 33 illustrates an illustrative detection system for "^'pi^^j h the column marked by the arrow if the 

scanning a DNA chip. 50 probe length were 15 and the position of substitution were 

DETAILED DESCRIPTION OF THE 7, 

INVENTION 3'-CCGACTGCAGTCGTr (SEQ. ID. N0:1) 

Using the VLSIPS™ method, one can synthesize arrays 3;-CCGACTAC^^^ jD. N0:2) 

of many thousandsofoligonucleotideprobesonasubstrate, ^'CCGACTCC^^^^ D. NO.S 

such as a glass slide or chip. The method can be used, for 55 ^ -CCGACTCCAG^^^ D. NU.l) 

instance, to synthesize "combinatorial" arrays consistmg of, ^^CCGACTTCAGTO 

for example, all possible octanucleotides. Such arrays can be Thus, the substitu ion lanes occupy fo^r of*^^^ ^^^^ 
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probes to detect a particular mutation or other specific "^^^^^^^ 'X^""^ 
Sequence of nucleotides. In some circumstanc^ ^^^^^ ^',^^\'Z, 1^^^ oi sub-nanomolar 
nucleic acids with repeated sequences or with high G/C ^ AF508 oligonucleotide tar- 
content, very long probes may be required for optimal ^^^^^.^ ^^.^^ ^^^^^^ fluorescein to these arrays 
detection. In one embodiment for detecting speciDc 5 ^^^j^^^^^ ^^^^y specific signals (detected with confocal 
sequences in a target nucleic acid with a DNA chip, repeat ^^^j^g fluorescence microscopy) that permit discrimina- 
sequences are detected as follows. The chip comprises ^.^^ between mutant and wild-type target sequences in both 
probes of length sufficient to extend into the repeat region homozygous and heterozygous cases. The method and chips 
varying distances from each end. The sample, prior to invention can also be used to detect other known 
hybridization, is treated with a labeled oligonucleotide that lO j^j^tations in the CFTR gene, as described in detaU below, 
is complementary to a repeat region but shorter than the full ^^^^ common cystic fibrosis mutation is known as 
length of the repeat. The target nucleic is labeled with a ^p5og^ because the mutation is a three-base deletionthat 
second distinct label. After hybridization, the chip is results in the removal of amino acid #508 from the CFTR 
scanned for probes that have bound both the labeled target protein. The present invention provides DNA chips for 
and the labeled oligonucleotide probe; the presence of such 15 detecting AF508. one such chip resuhs from applymg the 

hnund orobes shows that at least two repeat sequences are tiUng method to exon 10 of the CFTR gene, tne exon lo 
bound prodes snows y ^ ^^^^ ^^^^ ^^^^^ ^^.^^ ^^^^^^ ^^^^^^^ 

aucleicacidscontammgsequencescompn.mgrunsof3or 30 J^^'^^^^}^^'^ ^^l^^,^ ^hip. The A-lane 

sirs „ r^:!':^'^'^7=:^r.r^ 

Son i iltor ted by fhe detective of mutations in a similar fashion, substituuon lanes w..h replacement ba« 

S of cuSy and medicaUy significant human nucleic dC, dG. and dT were placed onto the chip m a C-lime 

Idd Luences T^r^^^^^ illustrated first with «G-lane," and a "T-lane," respecUvely A «xth lane on the 

re"lXtte prepa«^ DNAchips for the detection of chip consisted of probes identical to those m the wJd-type 

Sed :^"h cystic flb^sis. then with DNA « lane but for the deletion °f 

chips for the detection of human mitochondrial DNA restoration of the onpnal probe length by ad*t^on ^ 

S»ces to with DNA chips for the detection of muu- S'-end the base complementary to the gene a that P^tion^ 

mutations. The present mvention provides CFTR gene oli- 55 exon .^.^^^ ^^3^ „_ ^^^^ p^^bes 10, 14, 

been used to develop a number of useful CFTR gene chips. nucleic acids were made. Ttie finst, a ^^'^^f^^^ 
one illustrative chip bears an array of 1296 probes covering to a subsequence "^/.^J^ 

ESSES » ^^B^^ 

residues and can be used to detect and sequence any single- below: 
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5..cArrAAAGAAAAWCATcrrrGGTGrrrccTAr- ^^^SS:^:S:S?£SZ 

target by removing those same three bases, is caUed the comsponas » However, the probe in the T-lane of 
"muUnt-targetormuSOStargetandhasthesequenceshown 5 f"""^''^/^^;^ very^eU. 
below. fi«t with dashes to indicate the deleted ^a^^' -^ ^ExTSirn of he sequences ofYhe two targets reveals 
then without dashes but with one base underlinedXto indi- ^ ^ at that position when the 

cate the base detected by the T-lane probe, as dmissed J^j' ,heir 3'-ends and that the T-lane 

.»^*AAATATrAT---lo probe is complementary to the mutant target with but two 
5'-CATTAAAGAAAATA T C A 1 lo ?,T°° ,^u„ „ear an end (shown below in lower-case letters, 

TGGTGTITCCr^G/^GM^^^^ ^h of s«^ underUned): 

5'-CAITAAAGAAAArArCArTGGTGnTCCTArGAlUA. ""j. P5,.catTAAAGAAAATATCATTGGTGT- 

(SEQ.IDN0:7) . ^ . ttCCTATGATGA 

Both targets were labeled with fluorescem J^f p^^^^^^^ TAGTAACCACAA (SEQ. ID N0:8) 

In three separate experiments, the wJd-type a^e . Ae 15 Probe 3 l^^g ^ ^^^^ ^^^^^ ^^^^ 

mutant Urget. and an equimolar mixUire of both targeUwas 1^^ me P ^^^^^^ 
exposed (0.1 nM wt508, 0.1 nM mu508, and 0.1 nM w608 bas« ^oU mixture of the two targets, that T-lane probe 
plus 0.1 nM mu508, respect vdy, m » ^-^on compat*k f^^l™ "^^^^ target as does the A-lane probe in the 
U nucleic acid hybridization) to a CF <=h!P jl'^ hyb"d^ Jrn^ ^ J^. where^as in the other column sets, the 
ization mixture was incubated overnight at room 20 ^amyoiu . ^ 
temperature, and then the chip was scanned on reader (a P-j!* ''° °^^u ^hus. that one column set, and in 
confocal fluorescence microscope « Ph°'°"-~g r he Wane probe vJithin that set, detects the AF508 

imagesofthechipwereconstructedfromthephotoncount^^ Son under conditions that simulate the homozygous 
at several successively higher •^■"Pf '° „ ~a^l^tionsthatsimulatethehe.erozygouscase. 

contact with the target solution. After each temperature 25 c"^'"^ *^ ^<,^aes individual probes, 

change, the chip was allowed to equilibrate for appro«- The present in y 
matelyone-halfhourbeforebeingscanned.Aftereachsetof P^'^'^Vprolis pr^^^^^ 

scans: the chip was exposed to denaturing solvent and pattern. ^ j;""^/ PJ^^^j^^ ^ sequences. THe 

conditions to wash, i.e., remove target that had bound the ^ f/^f^j^^f j^ortant probes of the invention are 
chip so that the next experiment coM be done with a clean 30 « ^^^^ ^J^^ , ..x" stands for the point 

ehip- . , . Eirt i l A of substitution in a given column set, so each of the 

. mresultsoftheexperimenlsareshownmF GS.3,4,5 

and 6. FIG. 3, in panels A. B and C. shows « m'.de ^'l"/""™ J^, "X." Sets of shorter 

from the region of a DNA chip containing CFTR exon 10 T^^fP^'^^^^' ^ ,hown below by removing up 

probes; in panel A. the chip was hybridized to a wild-type 35 P't^ ^^'^^^.'s'.^^dot each probe and sets of longer 
urget; in panel C. the chip was hybridiz«=d to a mu ant delta '° "/^"^^^^^^ by adding up to three bases from 

508 targeu and in panel B. the chip was bybndized to a f f^^^^^^' Js'-cnd of each probe, are also 
mixtureof thewild-typeandmut^ttarge«.HG.4,.n heeu .^e invention. 

1-3, corresponding to panels A, B. and Cof FIG. 3, shows ^AGAAACC (SEQ. ID N0:9) 

graphs of fluorescence intensity versus tding position- IHe 40 ^ "E^^AGAAA V 
labelsonthehorizontalaxisshowtheba»smthewdd.type ^-TT/JAGXAGAAACCAg « 
sequence corresponding to the position of substitution in the ^ -WAGTXGAAA^^ ; 
respective probes. Plotted are the intensiUes observed from ^ -J^^^J^^cACAA (SEQ. ID N0:13) 
thefeatures(orsynthesissites)containingwild-typeprob», ^-TAGTAG^C^^^ > 
thefeaturescontainingthesubstitulionprobesthatboundthe 45 3;AGTA™cC^AC^ J W ^^ J^ 
most target ("called"), and the feature contaimng the sub- ^-GTAGA^CCACAAAO J U ^^^^ 
stitutionprobesthatboundtheta.getwitbUies«»ndh^^^^^ I AG^iJScAAAGGA (SEQ. ID N0:17) 
intensity of all the substitution probes ("and Highest ). 3 "AGAAA^^ALA/^ ^ ^^^^ ^ 

Tbc^ figures show that, for the wUd-type target and the ^ ^ js of the target, where there is 

equimolar mixture of targete, the substitution probe v.^^^ 50 '^^^^^^'^^^^^ target and probe to allow 
nucleotide sequence identical to the,correspoi^ding mid- P^^g^^^^ Ja,e center of the target, 

type probe bound the most target, aUowing for an jmam- f'^^'^^^^^^ ^,31, for some other reason, per- 
biguous assignment of target sequence as shwn by letters P^^^^^^ results show the method and the 

near the points on the curve THe Urget w^08 thus hyW- ^^'g.S^SJ^ be used to detect the mutation of 
ized to the probes in the wfld-type lane of the chip, although 55 °I mv«. hybridization 

theslrengthofthehybridizationvariedfromprobe-to-prob^^ ^TwsS^SrtXt of the ^08 target at the ends, 
probably due to differences in melting J'* Jo Ze a common sequence, and very different 

sequence of most of the target can thus be '^f^'f^i^^^^^l She middk where the deletion is located. As one scans the 
the chip, by inference from the pattern of hybridizauon m m the . . . j hybridization of the 

thelanLf'ubs.itutionprobes(ifthe.argethybn^^^^^^^ 

intensely to the probe in the A-lane then one -fers that the '"g«« ^« P „f ■ fo, „„508 than for 

urget has a T in the position of substitution and so on). "WjitTon there is one probe in the T-lane that 

For the muunt target, the sequence could sunJarly be w^M m „,u508 and hardly at all with 

caUed on the 3'-side of the deletion. However, the intensity ^"f '"he equimolar mixture of the two 

ofbindingdeclined precipitously as the point ofsubstih^^^^^^^ 65 w^08. encounter in 

approached the site of the deletion frotn the 3 -end of the argels. "h^^^ ^ P , „„,,tion, are a 

urget, so that the binding intensity on the wild-lype probe testing a heterozygous 
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blend of the results for the separate targets, showing the 
power of the invention to distinguish a wild-type target 
sequence from one containing the AF508 mutation and to 
detect a mixture of the two sequences. 

The results above clearly demonstrate how the DNA chips 5 
of the invention can be used to detect a deletion mutation, 
AF508; another model system was used to show that the 
chips can also be used to detect a point mutation as well. One 
of the more frequent mutations in the CFTR gene is G480C, 
which involves the replacement of the G in position 46 of lO 
exon 10 by a T, resulting in the substitution of a cysteine for 
the glycine normaUy in position #480 of the CFTR protein. 
The model target sequences included the 21-mer probe 
wt480 to represent the wild-type sequence at positions 
37-55 of exon 10: 5'-CCTrCAGAGGGTAAAArTAAG 15 
(SEQ ID NG:18) and the 21-mer probe mu480 to represent 
the mutant sequence: S'-CCTTCAGAGTGTAAAArTAAG 
(SEQ. ID N0:19). 



terns. The wild-type sequence could easily be read from the 
chip, but the probe that bound the mu480 target so weU when 
only the mu480 target was present also bound it well when 
both the mutant and wild-type targets were present in a 
mixture, making the hybridization pattern easily distinguish- 
able from that of the wild-type target alone. These results 
again show the power of the DNA chips of the invention to 
detect point mutations in both homo- and heterozygous 
individuals. 

To demonstrate clinical application of the DNA chips of 
the invention, the chips were used to study and detect 
mutations in nucleic acids from genomic samples. Genomic 
samples from a individual carrying only the wild-type gene 
and an individual heterozygous for AF508 were ampUfied by 
PGR using exon 10 primers containing the promoter for T7 
RNA polymerase. Illustrative primers of the invention are 
shown below. 



Exon Name Sequence 



(SEQ. ID. NO:20) 



10 CFi9-T7 TMTACGAOXIACrArAGGGAGatgacctaataatgatggsttt . 

10 CFilOcT? TAATACGACrCACTATAGGGAGtaglgtgaagggttcaUtgc SEQ. m. Na21 

10 Cril0c-T3 crcGGAATrAA(XCrcACrAAA(K>lagtgtgaa6ggt^^^^^ SEQ. Na22 

10 11 CFilO-T7 TAATACGACrCACrATAGGGAGagcatactaaaagtgactctc SEQ. ^. Na23 
ll' CFillc-r7 TAArACGACrCACrATAGGGAGacatgaatgacatttacagcaa SEQ. N0.24 

11 crtllc-T3 CGGAATTAACCCrCACTAAAGGacatgaatgacatttacagcaa (SEQ. LP. N0.25) 



In separate experiments, a DNA chip was hybridized to 
each of the targets wt480 and mu480, respectively, and then 
scanned with a confocal microscope. FIG. 5, in panels A, B, 
and C, shows an image made from the region of a DNA chip 
containing CFTR exon 10 probes; in panel A, the chip was 
hybridized to the wt480 target; in panel C, the chip was 
hybridized to the mu480 target; and m panel B, the chip was 
hybridized to a mixture of the wild-type and mutant targets. 
FIG. 6, in sheets 1-3, corresponding to panels A, B, and C 
of FIG. 5, shows graphs of fluorescence intensity versus 
tiling position. The labels on the horizontal axis show the 
bases in the wild-type sequence corresponding to the posi- 
tion of substitution in the respective probes. Plotted are the 
intensities observed from the features (or synthesis sites) 
containing wild-type probes, the features containing the 
substitution probes that bound the most target ("called"), and 
the feature containing the substitution probes that bound the 
target with the second highest intensity of all the substitution 
probes ("2nd Highest"). 

These figures show that the chip could be used to 
sequence a 16-base stretch from the center of the target 
wt480 and that discrimination against mismatches is quite 
good throughout the sequenced region. When the DNA chip 
was exposed to the target mu480, only one probe in the 
portion of the chip shown bound the target well: the probe 
in the set of probes devoted to identifying the base at 
position 46 in exon 10 and that has an A in the position of 
substitution and so is fully complementary to the central 
portion of the mutant target. All other probes in that region 
of the chip have at least one mismatch with the mutant target 
and therefore bind much less of it. In spite of that fact, the 
sequence of mu480 for several positions to both sides of the 
mutation can be read from the chip, albeit with much- 
reduced intensities from those observed with the wild-type 
target. 

The results also show that, when the two targets were 
mixed together and exposed to the chip, the hybridization 
pattern observed was a combination of the other two pat- 



30 These primers can be used to amplify exon 10 or exon 11 
sequences; in another embodiment, multiplex PGR is 
employed, using two or more pairs of primers to amplify 
more than one exon at a time. 
The product of amplification was then used as a template 
35 for the RNA polymerase, with fluoresceinated OTP present 
to label the RNA product. After sufficient RNA was made, 
it was fragmented and applied to an exon 10 DNA chip for 
15 minutes, after which the chip was washed with hybrid- 
ization buffer and scanned with the fluorescence micro- 
40 scope. A useful positive control included on many CF exon 
10 chips is the 8-mer 3'.CGCCGCCG.5'. HG. 7, in panels 
A and B, shows an image made from a region of a DNA chip 
containing CFTR exon 10 probes; in panel A, the chip was 
hybridized to nucleic acid derived from the genomic DNA of 
45 an individual with wild-type AF508 sequences; in panel B, 
the target nucleic acid originated from a heterozygous (with 
respect to the AF508 mutation) individual. FIG. 8, in sheets 
1 and 2, corresponding to panels A and B of FIG. 7, shows 
graphs of fluorescence intensity versus tiling position. 
50 These figures show that the sequence of the wild-type 
RNA can be called for most of the bases near the mutation. 
In the case of the AF508 heterozygous carrier, one particular 
probe, the same one that distinguished so clearly between 
the wild-type and mutant oligonucleotide targets m the 
55 model system described above, in the T-lane binds a large 
amount of RNA, while the same probe binds little RNA from 
the wild-type individual. These results show that the DNA 
chips of the invention are capable of detecting the AF508 
mutation in a heterozygous carrier. 
60 Thus, the present invention provides methods for synthe- 
sizing large numbers of oligonucleotide probes on a glass 
substrate and unique probe sets in a defined array in which 
the probes are arranged in the array by the "tiling" method 
of the invention. The DNA chips produced by the method 
65 can be used to detect mutations in particular sequences of a 
target nucleic acid, such as genomic DNA or RNAproduced 
from transcription of an amplified genomic DNA. These 
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chips can be used to detect both point mutations and small 
deletions. Moreover, the pattern of hybridization to the chip 
allows inferences to be drawn about the sequences of the 
mutant DNAs. 

For example, in the model system involving the cystic 5 
fibrosis point mutation G480C, the A-lane probe whose 
position of substitution corresponds to the position of the 
mutation does not bind much wild-type target, because in the 
wild-type sequence, a G occupies that position. However, it 
binds mutant target very well, allowing one to infer correctly 10 
that the mutation involves a change of that G to a T. 
Similarly, in the case of the three-base deletion in cystic 
fibrosis known as AF508, the T-lane probe that binds mutant 
target so intensely is responding to the fact that the deletion 
has brought a CAT sequence into the position occupied by 15 
a err sequence in the wild-type target. The DNA chips of 
the invention can be used to detect and sequence not only 
known mutations in an organism's genome but also new 
mutations not previously characterized. The DNA chips and 



some applications to using a minimal set of oligonucleotides 
specific to the sequence of interest, rather than a set of all 
possible N-mers. Some of these advantages include: (i) each 
position in the array is highly informative, whether or not 
hybridization occurs; (ii) nonspecific hybridization is mini- 
mized; (iii) it is straightforward to correlate hybridization 
differences with sequence differences, particularly with ref- 
erence to the hybridization pattern of a known standard; and 
(iv) the ability to address each probe independently during 
synthesis, using high resolution photolithography, allows the 
array to be designed and optimized for any sequence. For 
example the length of any probe can be varied independently 
of the others. 

The present invention illustrates these advantages by 
providing DNA chips and analytical methods for detecting 
specific sequences of human mitochondrial DNA. In one 
preferred embodiment, the invention provides a DNA chip 
for analyzing sequences contained in a 1.3 kb firagment of 



methods of the invention can also be used to detect specific 20 human mitochondrial DNA from the "D-loop" region, the 



sequences in other CFTR exons as well as other human 
genes for purposes of research and clinical genetic analysis, 
as demonstrated below. 

Detection of Specific Human Mitochondrial DNA 
Sequences with DNA Chips 

As noted above, the present invention provides DNA 
chips on which a known DNA sequence is represented as an 
array of overlapping oligonucleotides on a solid support. 
This set of oligonucleotides is used to probe a target nucleic 



most polymorphic region of human mitochondrial DNA. 
One such chip comprises a set of 269 overlapping oligo- 
nucleotide probes of varying length in the range of 9-^14 
nucleotides with varying overlaps arranged in -600x600 
25 micron features or synthesis sites in an array 1 cmxl cm in 
size. The probes on the chip are shown in columnar form 
below. An illustrative mitochondrial DNA chip of the inven- 
tion comprises the following probes (X, Y coordinates are 



shown, followed by the sequence; "DL3" represents the 
acid comprising the known sequence, allowing mutations to 30 3'-end of the probe, which is covalently attached to the. chip 
be detected. As also noted above, there are advantages in surface.) 
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9 2 

DUAGTXKjGGTAnr (SEQ ID. NO:26) 10 2 

DUGGGTATITAGTr (SEQ ID. NO:2'0 H 2 

DUTTAGnTATCCAA (SEQ ID. N0:2S) 12 2 

DUATCCAAACXAGG (SEQ ID. NO:29) 13 2 

DUACCAGGATCGGA (SEQ ID. NO 30) 14 2 

DL3CGTGTGTGTGTGG (SEQ ID. N0:31) 15 2 

DL3CGTGTGTGTGTGGC (SEQ ID. NO J2) 16 2 

DL3TCGTGTGTGTGTGG (SEQ ID. NO:33) 0 3 

DL3GTAGGATGGGTC (SEQ ID. NO:34) 1 3 

DLSAGGATGGGTCGT (SEQ ID. N0-J5) 2 3 

DLSGATGGGTOGTGT (SEQ ID. NO:36) 3 3 

DLSTGGCGACGATTG (SEQ ID. NO J7) 4 3 

DL3GCGACGAITGGG (SEQ ID. NO:38) 5 3 

DL3TtjGGGGGGA 6 3 

DL3GAGGGGGCG 7 3 

DL3GGAGGGGGCGA (SEQ ID. N0^}9) 8 3 

DL3GAGGGGGCGA (SEQ ID. NO:40) 9 3 

DLSGGOTGGTrcG (SEQ ID. NO:41) 10 3 

DLSGGTTGGTITGGG (SEQ ID. NO:42) 11 3 

DLSTGGGGTrrCTAG (SEQ ID. NO:43) 12 3 

DLSGTITCTAGTGGG (SEQ ID. NO:44) 13 3 

DL3AGTGGGGGGTGT (SEQ ID. NO:45) 14 3 

DUGGGGTGTCAAAT (SEQ ID. NO:46) 15 3 

DUGTCAAATACATCG (SEQ ID. NO:47) 16 3 

DUACATCGAATGGAG (SEQ ID. NO:48) 0 4 

DUCGAATGGAGGAG (SEQ ID. NO:49) 1 4 

DL3GAGGAG'nTCGT (SEQ ID. NO:50) 2 4 

DUTrrCGTTArGTGA (SEQ ID. NO:51) 3 4 

DUATGTGACmTAC (SEQ ID. NO:52) 4 4 

DUGACmTACAAAT (SEQ ID. NO:53) 5 4 

DUAAATCTGCCCGA (SEQ ID. NO:54) 6 4 

DUAATCTGCCrGAG (SEQ ID. NO:55) 7 4 

DL3CXXGAGTGTAGT (SEQ ID. NO:56) 8 4 

DL3AGTCTAGTGGGG (SEQ ID. NO:57) 9 4 

DUGGGAGGGTGAG (SEQ ID. NO:58) 10 4 

DUGGTGAGGGTATG (SEQ ID, NO:59) 11 4 

DUGGTATGATGATTAG (SEQ ID. NO:60) 12 4 

DL3GAITAGAGTAAGT (SEQ ID. N0:61) 13 4 

DUTTAGAGTAAGITA (SEQ ID. NO:62) 14 4 



DL3GGTAGGArGGGT 

DL3GGATGGGTCGTG 

DUGGTCGTGTGTGT 

DL3GTGTGTGTGGCG 

DUTGTGGCGACGAT 

DL3GACGAITGGGGT 

DUATTGGGGTArGG 

DL3GTArGGGGCnXj 

DUGGATTGTGGTCG 

DL3TGGTCGGAITGG 

DL3GGAITGGTCTAAA 

DUTCTAAAGTITAAA 

DUGTITAAAATAGAA 

DUATAGAAAAACCG 

DL3AGAAAAACCGC 

DL3AAC0GCCATAC 

DUCCATACGTGAAAA 

DUACGTGAAAATTGT 

DUAATTGTCAGTGGG 

DL3TGTCAGTGGGGG 

DIJTGGGGTTGA 

DUGGGTTGAnXjTGT 

DUITGTGTAArAAAA 

DUAATAAAAGGGGA 

DUTAAAAGGGGAGG 

DL3GTmTTAAAGG 

DL3TnTAAAGGTGG 

DUAGGTGGTrrcO 

DUriTGGGGGGGAG 

DL3GGAGGGGGCG 

DL3GGGGCGAAGAC 

DL3GAAGACCGGATG 

DL3CCGGATGTCGTG 

DUGTCGTGAAnTGT 

DUCGTGAATrTGTCT 

DUTTGIGTAGAGACG 

DUTAGAGACGGnr 

DL3ACGGnTGGGG 

DUTGGGGTmTGT 

DL3GGGTmTGTTT 
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(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
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(SEQ ID. 
(SEQ ID. 
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NO:67) 

NO:68) 

NO:69) 

NO:70) 

N0:71) 

N0:72) 

NO:73) • 
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NO:75) 

NO:76) 
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NO:78) 

NO:79) 

NO:S0) 

N0:81) 

NO:82) 
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N0:S4) 

NO:85) 

NO:8j5) 
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NO:88) . 

N0:S9) 

NO:90) 

N0:91) 
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(SEQ ID. NO:215) 
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DLiCCTArGGTIOnT 


(SEQ ID. NO:216) 
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DUGGTrGnrGGATO 


(SEQ ID. NO:217) 


13 


14 
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DUGTITGGArGGGT 


(SEQ ID. NO:218) 
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14 
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DUATGGGTGGGAAr 


(SEQ ID. NO:219) 
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DL3GGGAAITGTCArG 


(SEQ ID. KO:220) 


16 


14 
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DL3GTCArGTArCATGT 


(SEQ ID. NO:221) 
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15 
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DUTCATGTAnTCGG 


(SEQ ID. NO:222) 
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15 
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11 


DL3TAnTCGGTAAA 


(SEQ ID. NO:223) 
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15 


13 


11 


DUTrcGGTAAATGG 


(SEQ ID. NO:224) 
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DUGTAAATGGCATGT 


(SEQ ID. NO:225) 
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15 
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DUGG^TAATCGTG 


(SEQ ID. NO:226) 


10 


15 
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DUGTAATCGTGTAAT 


(SEQ ID. NO:227) 


11 


15 
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DUGGGAGGGGTAC 


(SEQ ID. NO:228) 


12 


15 
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12 


DL3GGGTACGAATGT 


(SEQ ID. NO:229) 


13 


15 
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DUACGAATGTTCGTT 


(SEQ ID. NO:230) 


14 


15 
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12 


DLSTGTrcGTrCATGT 


(SEQ ID. NO:231) 


15 


15 
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12 


DUCGTTCATGTCGTT 


(SEQ ID. NO:232) 


16 


15 



DUrTGTITCITGGG 

DurcnxKKjArrGTC 

DUrGTATGAATGAnT 

DUTGAnrCACACAA 

DUCTCTGCGACCrC 

DUGACCrCGGCCT 

DUrCGGOCrCGTG 

DLJGATGAAGTCCCAG 

DUAGTCCCAGTAnr 

DUGTAnrCXKiAnT 

DUTCGGAnTATCG 

DUGAnTATCGGGT 

DUATCOGGTGTGCA 

DL3TGTGCAAGGGGA 

DUCAAGGGGAAnr 

DUGAAnTAITCTGTA 

DUTCrcTAGTGCrAC 

DUGTAGTGCTACCr 

DL3GCTACCTAGTAG 

DUCTAGTAGTCXIAGA 

DUrCCAGArASTGGG 

DUAGATAGTGGGATA 

DUGGGATAAITGOT 

DUTAAITGGTGAGTG 

DUTATAGGGCGTGT 

DUGGGCGTGTTCTCA 

DUGTGTTCTCACGAr 

DL3rrcACGArGAGAGG 

DUATGAGAGGAGCG 

DUAGGAGCGAGGC 

DL3CGAGGCCCGG 

DI^JGCCCGGGTATT 

DUCGGGTATTGTGA 

DUGTGAACCCCCAT 

DUCCCCATCGATTT 

DUATCGAnrcAcrr 

DL3mTCACITGACAr 

DL3TrcACArAGAGCr 

DUTAGAGCTGTAGAC 

DL3GTAGACCAAGGA 

DUACCAAGGATGAAG 

DL3CGTGTAArGTCAG 

DUTCTCAGnTAGGG 

DUTCAGTITAGGGA 

DL3TAGGGAAGAGCA 

DUAAGAGCAGGGGT 

DL3CAGGGGTACCTA 

DUGGTACCTACTOG 

DUTACrOGGGGGA 

DUGGGGG AGTCT AT 

DL3CArGTAnnTGG 

DUTnTGGGTTAGG 

DUGGGTTAG GArGT 

DL3GGArGTAGmTG 

DL3TGTAGTnTGGG 

DUTITGGGGGAGG 

DL3GGGTrcArAACrG 

DUATAACTGAGTGGG 

DUAACTGAGTGGGT 

DUGTGGGTAGTTGT . 

DUGTAGTIGTrGGC 

DL3GTIXjG0GArACA 

DL3CGATACATAAAAG 

DUrAAAAGCATGTAA 

DL3GCArGTAArGACG 

DUATOACGGTCGGT 

DUGTCGGTOGTACr 

DUGGTACTTATAACA 

DL3TCGAITCTAAGAr 

DUTAAGAJTAAAnr 

DLSAAAnTGAATAAG 

DUAATAAGAGACAAG 

DUAAGAGACAAGAAA 

DUAAGAAAGTACCC 

DUAAAGTAOCCCTT 

DUCCCCrrCGTCTA 

DUCTTCOTCTAAAC 

DUCrAAACCCATGG 

DUAACCCATGGTGG 

DLJrGGTGGGTrCAr 
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DUGTCGirAGnXK) 


(SEQ ID. NO:233) 
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16 


DUTTGGAAAAAGGT 


(SEQ ID. NO:276) 
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12 


DUTAGTrcOGAGTT 


(SEQ ID. NO:234) 
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16 


DL3AAAAGGTrCCTG 


(SEQ ID. NQ:277) 


12 


12 


DUGGAGTTGATAGTG 


(SEQ ID. NO:235) 
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16 


DUGOTTCCTGTTTA 


(SEQ ID. NO:278) 


13 


12 


DUATAGTGTGTAGTT 


(SEQ ID. NO:236) 
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16 


DLJCCTGnTAGTCrC 


(SEQ ID. NO:279) 


14 
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DUGTFIAGITGACGT 


(SEQ ID. NO:237) 
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16 


DUTTAGTCTCrill'i' 


(SEQ ID. NO:2«0) 


15 


12 


DUTGACGTTGAGGT 


(SEQ ID. NO:238) 


10 


16 


DL3CnillCAGAAAr 


(SEQ ID. NO:281) 


16 


12 


DUCG1TGAGGTITA 


(SEQ ID. NO:239) 


11 


16 


DUAGAAATTGAGGTG 


(SEQ ID. NO:282) 
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13 


DUTATAACATGCCAT 


(SEQ ID. NO:240) 


12 


16 


DUAAATl'GAGGTGGT 


(SEQ ID. NO;2S3) 
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13 


DLSAACATGCCATGGT 


(SEQ ID. NO:241) 


13 


16 


DL3GOTGGTAArCGT 


(SEQ ID. NO:284) 
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13 


DLSCCATGGTAITAT 


(SEQ ID, KO:242) 


14 
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DL3TAArCGTGGGTr 


(SEQ ID. NO:285) 
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13 


DLSAnTATGAACTGG 


(SEQ ID. NO:243) 


15 
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DDGTGGGnTCGAr 


(SEQ ID. NO;286) 
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DLSAACTGGTGGACAT 


(SEQ ID. NO:244) 


16 


16 


DUGGnrCGATTCT 


(SEQ ID. NO:287) 


10 


13 


DL3TGGACATCATGTA 


(SEQ ID. NO:245) 











No probes were present in positions X, Y=0, 12 to X, Y«4, 
12; X, Y=0, 13 to X, Y«4, 13; X, Y»0, 14 to X, Y=4. 14; X, 
Y-0, 15 to X, Y-4, 15; X. Y-0, 16 to X, Y-4, 16; The length 
of each of the probes on the' chip was variable to minimize 
differences in melting temperature and potential for cross- 
hybridization. Each position in the sequence is represented 
by at least one probe and most positions are represented by 
2 or more probes. As noted above, the amount of overlap 
between the oligonucleotides varies from probe to probe. 
FIG. 9 shows the human mitochondrial genome; "0/," is the 
H strand origin of replication, and arrows indicate the cloned 
unshaded sequence. 

DNA was prepared from hair roots of six human donors 
(mtl to mt6) and then amplified by PCR and cloned into 
M13; the resulting clones were sequenced using chain 
terminators to verify that the desired specific sequences were 
present. DNA from the sequenced M13 clones was amplified 
by PCR, transcribed in vitro, and labeled with fluorescein- 
UTP using T3 RNA polymerase. The 1.3 kb RNA transcripts 
were fragmented and hybridized to the chip. The results 
showed that each different individual had DNA that pro- 
duced a unique hybridization fingerprint on the chip and that 
the differences in the observed patterns could be correlated 
with differences in the cloned genomic DNA sequence. The 
results also demonstrated that very long sequences of a 
target nucleic acid can be represented comprehensively as a 
specific set of overlapping oligonucleotides and that arrays 
of such probe sets can be usefully applied to genetic analy- 
sis. 

The sample nucleic acid was hybridized to the chip in a 
solution composed of 6xSSPE, 0.1% Triton-X 100 for 60 
minutes at 15° C. The chip was then scanned by confocal 
scanning fluorescence microscopy. The individual features 
on the chip were 588x588 microns, but the lower left 5x5 
square features in the array did not contain probes. To 
quantitate the data, pixel counts were measured within each 
synthesis site. Pixels represent 50x50 microns. The fluores- 
cence intensity for each feature was scaled to a mean 
determined from 27 bright features. After scanning, the chip 
was stripped and rehybridized; all six samples were hybrid- 
ized to the same chip. FIG. 10 shows the image observed 
from the mt4 sample on the DNA chip. FIG. 11 shows the 
image observed from the mt5 sample on the DNA chip. FIG. 
12 shows the predicted difference image between the mt4 
and mt5 samples on the DNA chip based on mismatches 
between the two samples and the reference sequence (see 
Anderson ct al., 1981, Nature 290: 457-465, incorporated 
herein by reference). FIG. 13 shows the actual difference 
image observed. 

The results show that, in almost all cases, mismatched 
probe/target hybrids resulted in lower fluorescence intensity 
than perfectly matched hybrids. Nonetheless, some probes 
detected mutations (or specific sequences) better than others. 



and in several cases, the differences were within noise levels. 
Improvements can be realized by increasing the amount of 
overlap between probes and hence overall probe density 
and, for duplex DNA targets, using a second set of probes, 
either on the same or a separate chip, corresponding to the 
second strand of the target. FIG. 14, in sheets 1 and 2, shows 
a plot of normalized intensities across rows 10 and 11 of the 
array and a tabulation of the mutations detected. 

FIG. 15 shows the discrimination between wild- type and 
mutant hybrids obtained with this chip. The median of the 

25 six normalized hybridization scores for each probe was 
taken. The graph plots the ratio of the median score to the 
normalized hybridization score versus mean counts. On this 
graph, a ratio of 1.6 and mean counts above 50 yield no false 
positives, and while it is clear that detection of some mutants 

30 can be improved, excellent discrimination is achieved, con- 
sidering the small size of the array. FIG. 16 illustrates how 
the identity of the base mismatch may influence the ability 
to discriminate mutant and wild-type sequences more than 
the position of the mismatch within an oligonucleotide 

2^ probe. The mismatch position is expressed as % of probe 
length from the 3 -end. The base change is indicated on the 
graph. These results show that the DNA chip increases the 
capacity of the standard reverse dot blot format by orders of 
magnitude, extending the power of that approach many fold 
and that the methods of the invention are more efiScient and 

^ easier to automate than gel-based methods of nucleic acid 
sequence and mutation analysis. 

These advantages become more apparent as chips with 
more and more probes are employed. To illustrate, the 
present invention provides a DNA chip for analyzing human 

45 mitochondrial DNA (mtDNA) that "tiles" through 648 
nucleotides of human H strand mtDNA from positions 
16280 to 356. The probes in the array are 15 nucleotides in 
length, and each position in the target sequence is repre- 
sented by a set of 4 probes (A, C, G, T substitutions), which 

50 differed from one another at position 7 from the 3'-end. The 
array consists of 13 blocks of 4x50 probes: each block scans 
through 50 nucleotides of contiguous mtDNA sequence;- The 
blocks are separated by blank rows. The 4 corner columns 
contain control probes; there are a total of 2600 probes in a 
1.28 cmxl.28 cm square area (feature), and each area is 
256x197 microns. 

Labeled RNA target DNA was prepared by PCR ampli- 
fication of a 1.3 kb region of human mtDNA spanning 
positions 15935 to 667, cloning into M13 (sequence verifi- 
cation was performed), and reampUfication of the cloned 
sequences using primers tagged with T3 and T7 RNA 
polymerase promoter sequences and in vitro transcription to 
produce fluorescein-UTP labeled RNA. The RNA was frag- 
mented and hybridized to the oligonucleotide array in a 
solution composed of 6xSSPE, 0.1% Triton X-100 for 60 

65 minutes at 18* C. Unhybridized material was washed away 
with buffer, and the chip was scanned at 25 micron pixel 
resolution. 
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FIG. 17 provides a 5' to 3' sequence listing of one target between the particular mutation in p53 and the functioning 

corresponding to the probes on the chip. X is a control probe. of the resulting protein. Furthermore, there are projects 

Positions that differ m the target (i.e., are mismatched with looking at the germline inheritance of p53 mutations and the 

the probe at the designated site) are in bold. FIG. 18 shows development of cancer. The present invention provides 

, the fluorescence image produced by scanning the chip when 5 useful DNA chips and methods for such studies, 

hybridized to this sample. About 95% of the sequence could In addition, the present invention also provides a diag- 

be read correctly from only one strand of the original duplex nostic test kit and method and p53 probes immobilized on a 

target nucleic acid. Although some probes did not provide DNA chip in an organized array. Currently available diag- 

excellent discrimination and some probes did not appear to nostic tests for cancer typically have a sensitivity of about 

hybridize to the target efficiently, excellent results were lO 50%. The present invention provides significant advantages 

achieved. The target sequence differed from the probe set at over such tests, and in one embodiment provides a method 

six positions: 4 transitions and 2 insertions. All 4 transitions for detecting cancer-causing mutations in p53 that involves 

were detected, and specific probes could readily be incor- the steps of (1) obtaining a biopsy, which is optionally 

porated into the array to detect insertions or deletions. FIG. fractionated by cryostat sectioning to enrich tumor cells to 

19 illustrates the detection of 4 transitions in the target 15 about 80% of the total cell population. The DNA or RNA is 

sequence relative to the wild-type probes on the chip. then extracted, amplified, and analyzed with a DNA chip for 

These results illustrate that longer sequences can be read the presence of p53 mutations correlated with malignancy, 

using the DNA chips and methods of the invention, as To illustrate the value of the DNA chips of the present 

compared to conventional sequencing methods, where read- invention in such a method, a DNA chip was synthesized by 

ing length is limited by the resolution of gel electrophoresis. 20 the VLSIPS™ method to provide an array of overlapping 

Similar results were observed when genomic DNA samples probes which represent or tile across .a 60 base region of 

were prepared from human hair roots. Hybridization and exon 6 of the p53 gene. To demonstrate the ability to detect 

signal detection require less than an hour and can be readily substitution mutations in the target, twelve different single 

shortened by appropriate choice of buffers, temperatures, substitution mutations (wild type and three different substi- 

probes, and reagents. In principle, longer sequence reads can 25 tutions at each of three positions) were represented on the 

be obtained than by conventional sequencing, where reading chip along with the wild type. Each of these mutations was 

length is limited by the resolution of gel electrophoresis. represented by a series of twelve 12-mer oligonucleotide 

P53 Sequencing and Diagnostic DNA Chips probes, which were complementary to the wild type target 

P53 is a tumor suppressor gene that has been found to be except at the one substituted base. Each of the twelve probes 

mutated in most forms of cancer (see Levine et al, 1991, 30 was complementary to a different region of the target and 

Nature 351: 453-456, and Hollstein et al, 1991, Science contained the mutated base at a different position, e.g., if the 

253: ,49-53, each of which is incorporated herein by . substitution was at base 32, the set of probes would be 

reference). In addition, there is a hereditary syndrome, complementary-with the exception of base 32 — to regions 

Li-Fraumeni, in which individuals inherit mutant alleles of of the target 21-32, 22-33, and 32-43). This enabled inves- 

p53 and tend to have cancer at relatively young ages 35 ligation of the effect of the substitution position within the 

(Frebourg et al., 1992, PNAS 89: 6413-6417, incorporated probe. The alignment of some of the probes with a 12-mer 

herein by reference). During the development of a cancer, model target nucleic acid is shown in FIG. 20. 

p53 is inactivated. The course of p53 inactivation generally To demonstrate the effect of probe length, an additional 

involves a mutation in one copy of p53 and is often followed series of ten 10-mer probes was included for each mutation 

by deletion of the other copy. After p53 is inactivated, 40 (seeFIG, 21). In the vicinity of the substituted positions, the 

chromosomal abnormalities begin to appear in tumors. In wild-type sequence was represented by every possible over- 

the best understood form of cancer, colorectal cancer, well lapping 12-mer and 10-mer probe. To simplify comparisons, 

over 50%, perhaps 80%, of all patients with tumors have p53 the probes corresponding to each varied position were 

mutations. In addition, p53 mutations have been found in a arranged on the chip in the rectangular regions with the 

high proportion of lung, breast, and other tumors (Rodrigues 45 following structure: each row of cells represents one 

et al., 1990, PNAS 87: 7555-7559, incorporated herein by substitution, with the top row representing the wild type, 

reference). According to data presented by David Sidransky Each column contains probes complementary to the same 

(1992 San Diego Conference), over 400 mutations in p53 are region of the target, with probes complementary to the 

known. 3'-end of the target on the left and probes complementary to 

The p53 gene spans 20 kbp in humans and has 11 exons, 50 the 5'-end of the target on the right. Thie difference between 

10 of which are protein coding (see Tominaga et al, 1992, two adjacent columns is a single base shift in the positioning 

Critical Reviews in Oncogenesis 3: 257-282, incorporated of the probes. Whenever possible, the series of 10-mer 

herein by reference). The gene produces a 53 kilodalton probes were placed in four rows immediately underneath 

phosphoprotein that regulates DNA replication. The protein and aligned with the 4 rows of 12-mer probes for the same 

acts to halt replication at the.Gl/S boundary in the cell cycle 55 mutation. 

and is believed to act as a "molecular policeman," shutting To provide model targets, 5' fluoresceinated 12-mers 

down replication when the DNA is damaged or blocking the containing all possible substitutions in the first position of 

reproduction of DNA viruses (see Lane, 1992, Nature 35S: codon 192 were synthesized (see the starred position in the 

15-16, incorporated herein by reference). There is substan- target in FIG. 20). Solutions containing 10 nM target DNA 

tial interest in the cancer research community in analyzing 60 in 6xSSPE, 0.25% Triton X-100 were hybridized to the chip 

p53 mutations. The NCI is currently funding contracts to at room temperature for several hours. While target nucleic 

characterize the p53 mutation spectra caused by various was hybridized to the chip, the fluorophores on the chip were 

carcinogens. In addition, there are research projects which excited by light from an argon laser, and the chip was 

involve sequencing p53 from spontaneously arising tumors. scanned with an autofocusing confocal microscope. The 

A major resource in these studies is the huge supply of 65 emitted signals were processed by a PC to produce an image 

biopsy material stored in paraflSn blocks. Also, there are using image analysis software. By 1 to 3 hours, the signal 

projects which are aimed at analyzing the relationship had reached a plateau; to remove the hybridized target and 
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allow hybridization to another target, the chip was stripped 
with 60% formamide, 2xSSPE at 17" C. for 5 minutes. The 
washing buffer and temperature can vary, but the buffer 
typically contains 2-to-3xSSPE, lO-to-60% formamide (one 
can use multiple washes, increasing the formamide concen- 
tration by 10% each wash, and scanning between washes to 
determine when the wash is complete), and optionally a. 
small percentage of Triton X-100, and the temperature is 
typically in the range of 15* to 18* C. 

Very distinct patterns were observed after hybridization 
with targets with 1 base substitutions and visualization with 
a confocal microscope and software analysis, as shown in 
FIG. 22. In general, the probes which form perfect matches 
with the target retain the highest signal. For example, in the 
first image in Figure PC, the 12-mer probes that form perfect 
matches with the wild-type (WT) target are in the first row 
(top). The 12-mer probes with single base mismatches are 
located in the second, third, and fourth rows and have much 
lower signals. The data is also depicted graphically in FIG. 
23! On each graph, the X ordinate is the position of the probe 
in its row on the chip, and the Y ordinate is the signal at that 
probe site after hybridization. 

When a target with a different one base substitution is 
hybridized the complementary set of probes has the highest 
signal (see pictures 2, 3, and 4 in FIG. 22 and graphs 2, 3, 
and 4 in FIG. 23). In each case, the probe set with no 
mismatches with the target has the highest signals. Within a 
12-mer probe set, the signal was highest at position 6 or 7. 
The graphs show that the signal difference between 12-mer 
probes at the same X ordinate tended to be greatest at 
positions 5 and 8 when the target and the complementary 
probes formed 10 base pairs and 11 base pairs, respectively. 
Because tumors often have both WT and mutant p53 genes, 
mixed target populations were also hybridized to the chip, as 
shown in FIG. 24. When the hybridization solution consisted 
of a 1:1 mixture of WT 12-mer and a 12-mer with a 
substitution in position 7 of the target, the sets of probes that 
were perfectly matched to both targets showed higher sig- 
nals than the other probe sets. 

The hybridization efficiency of a 10-mer probe array as 
compared to a 12-mer probe array was also compared. The 
10-mer and 12-mer probe arrays gave comparable signals 
(see graphs 1-4 in FIG. 23 and graphs l-A in FIG. 25). 
However, the 10-mer probe sets, which are in rows 5-8 (see 
images in FIG. 22), seemed to be better in this model system 
than the 12-mer probe sets at resolving one target from 
another, consistent with the expectation that one base mis- 
matches are more destabilizing for 10-mers than 12-mers, 
Hybridization results within probe sets perfectly matched to 
target also followed the expectation that, the more matches 
the individual probe formed with the target, the higher the 
signal. However, duplexes with two 3' dangles (see FIG. 23, 
position 6 in graphs 1-4) have about as much signal as the 
probes which are matched along their entire length (see FIG. 
23, position 7, in graphs 1-4), 

This illustrative model system shows that 12-mer targets 
that differ by one base substitutions can be readily distin- 
guished from one another by the novel probe array provided 
by the invention and that resolution of the different 12-mer 
targets was somewhat better with the 10-mer probe sets than 
with the 12-mer probe sets. The value of having several 
overlapping probes hybridizing to a target demonstrates the 
value of the multiple hybridization events that take place on 
a DNA chip of the invention. The results also demonstrate 
the feasibility of constructing a probe set to sequence the 
entire 1.4 kbp protein coding region of p53 or alternatively 
the 0.6 kbp of exons 5-9 containing mutation hot spots. 
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For sequencing, the p53 DNA can be cloned from the 
sample or directly amplified from genomic DNA by PGR. If 
genomic PGR is used, then the DNA can be diluted prior to 
amplification so that a single copy of the gene is amplified. 
5 For diagnostic purposes, the genomic DNA can be isolated 
from a tumor biopsy in which the tumor cells may be the 
majority population. As noted above, the proportion of 
tumor cells in a sample can be enriched by cryostat section- 
ing, DNA can also be isolated and amplified from tumor 
10 samples stored in parafSn blocks. 

The p53 DNA in the sample can be amplified by PGR 
(although other amplification methods can be used) using 
3-4 primer pairs generating amplicons of <3 kbp each. 
Illustrative primers of the invention for amplifying exon 5 of 
15 the p53 gene are shown below (B is biotin; F is fluorescein). 
5'-B-CACTTGTGCCCTGACTTTCAAC-3'(SEQ. ID 

NO:288) 

5'-F-CACTrGTGCCCTGACTTTCAAC-3* 
S'-ATGCAATTAACCCTCACTAAAGGGAGACACTTG- 
20 TGCCCTGACTrrCAAC-3'(SEQ. ID NO:289) (has T3 
promoter) 

5'-B-GACCCTGGGCAACCAGCCCTGTCGT.3'(SEQ. ID 
NO:290) 

5*-F-GACCCTGGGCAACCAGCCCTGTCGT-3' 
25 S'-TAATACGACTCACTATAGGGAGGACCCTGGGCA- 
ACCAGCCCTGTCGT-3'(SEQ. ID NO:291) (has T3 
promoter) 

After PGR amplification of the target (the amplified target is 
called the "amplicon") one strand of the amplicon can then 

30 be isolated, i.e., using a biotinylated primer that allows 
capture of the undesired strand on streptavidin beads. 
Alternatively, asymmetric PGR can be used to generate a 
single-stranded target. Another approach involves the gen- 
eration of single stranded RNA form the PGR product by 

35 incorporating a T7 or other RNA polymerase promoter in 
one of the primers. The single-stranded material can option- 
ally be fragmented to generate smaller nucleic adds with 
less significant secondary struchire than longer nucleic 
acids. 

40 In one such method, fragmentation is combined with 
labeling. To illustrate, degenerate 8-mers or other degenerate 
short oligonucleotides are hybridized to the single-stranded 
target material In the next step, a DNA polymerase is added 
with the four different dideoxynucleotides, each labeled with 

45 a different fluorophore. Fluorophore-labeled dideoxynucle- 
otide are available from a variety of commercial suppliers, 
such as ABI. Hybridized 8-mers are extended by a labeled 
dideoxynucleotide. After an optional purification step, i.e., 
with a size exclusion column, the labeled 9-mers are hybrid- 

50 ized to the chip. Other methods of target fragmentation can 
be employed. The single-stranded DNA can be fragmented 
by partial degradation with a DNAse or partial depurination 
with acid. Labeling can be accomplished in a separate step, 
i.e., fluorophore-labeled nucleotides are incorporated before 

55 the fragmentation step or a DNA binding fluorophore, such 
as ethidium homodimer, is attached to the target after 
fragmentation. 

In one embodiment, the DNA chip has an array of 10* to 
10^ probes tiling across the protein coding regions of p53, 

60 which comprise about 1200 bp; smaller arrays specific for 
the 600 bp mutational hot spot region are also useful. The 
probes overlap for N-2 to N-4 bases, where N is the length 
of the probe in bases. N is typically 10 to'l4 bases long, but 
as will be seen below, probes 15 to 19 bases and longer are 

65 also useful. Every possible single base substitution occur- 
ring one at a time is represented in the array. The number of 
unique 10-mer probes with 7 base overlaps would be about 
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(1200/3)x4xl0 or about 1.6x10^ To aUow 3 replicates of of DNA. First, the target DNA is amplified by PGR with 

each probe, one might have a total array size on the order of primers allowing easy ligation into a vector, which is taken 

4.8x10'* probes. Of course, arrays of probes within the up by transformation of E. coli which in turn must be 

ranges of 10^ to 10* probes are also useful for applications; cultured, typicaUy on plates overnight. After growth of the 

for example, very large arrays of 10' or more probes are 5 bacteria, DNA is purified in a procedure that typicaUy takes 

useful for sequencing or sequence checking large genomic ut 2 hours; then, the scquencmg reactions are performed, 

DNAfragments. Optionally fragmented and labeled target which takes at least another hour and the sampl^ a^^ 

t • -J I. L • J- J * *u L- • J * * J u f I the eel for several hours, the duration dependmg on the 

nucleic acid hybnd.zed to the ch.p .s detected by a confocal f ^ ^ ^ P 

microscope or other imagmg device. The pattern of sites invention provides direct Lalysis of ^ePCRampU- 

"bghtmg up- with Urget IS preferably analyzed with com- lo ^^^^^^j ^^^^ ^^^^ transcription and fragmentation 

puter assistance to prowde the sequence of the target from ^^^^ ^^^^ ,5^^,^ j^^or. 

the pattern of sites producing signals. , , An interesting clinical application for the characterization 

The invention is illustrated below with examples of DNA heterozygous mutations with DNA chips is as foUows. 

chips comprismg very large arrays of DNA prob« to rese- individuals with germUne cancer mutations have a very high 

quenoe p53 target nucleic acid m a sample. To analyze 15 ^ ^^^^^ ^^^^ j^^^^^^^, ^ i„,diation. 

DNA from exon 5 of the p53 tumor suppressor gene, a set ^„ ^^^^^^ .^^^ ^^^^ ^„ta- 

of overlapping H-mer probes was syn&esized on a chip. 53 ^^^^ suppressor genes. TTius, before 

The probes for the WT allele were synthesized so as to tile ^ ^^^^^^^ ^^^^ ^ ^^^^^ ^„,d use the 

across the entire exon with smgle base overlaps between j^^^^,;^^ ,,3, , 

probes. For each WT probe, a sets of 4 additional probes, 20 suppressor gene mutation, 

one for each possible base subsUtution at posiUon 7, were g^,;^^^, -^erapeutic Management 

??'"*T?fx,! r/?!^T^ The present invention also provides DNA chips that can 

Exon 5 DNA was amplified by PCR with pnniers flanking ^ . ^ysicians to determine optimum therapeutic 

the exon. One of the primers was labeled with fluorescem; ^ y j^,^^;^^ biologically mediated 

themherprunerwaslabeledwithbiotm.AfteramphflcaUon, 25 Resistance to a therapeutic agent in a variety of disease states, 

the biotmylated strand was removed by bindmg to strepta- ^^^g^ of such DNA chips are many, as the chips will 

yidm beads. Tlie fluoresceinaled strand was used in hybrid- physicians recognize health care cost savings, achieve 

"^^P"" ,, , , rapid therapeutic benefits, limit administration of ineffective 

A "^^.^^ ^ «prf «n» r " ,n (d« to 'h<= ^^^^) y^' '""i^ dmgs, monitor changes in 

washybridizedovermghtin 5><SSPE at 60 C lo the probe 30 ^ ^^^^ ^^^^^ ,^ ^ ^^^^^ 

chip (under a cover shp). After washmg with 6xSSPE the applications include the treatment of 

chipwasscaDnedusmgmalmicroscopy HU.i(.sno^^^ HIV. other infectious diseases, and cancer. 

an unage 0 the pS3 chip hybndiz^d to the Urget DN/l hIV has infected a large and expanding number of people. 

Analysis of the mtensuy data showed that 93.5% of to 184 ^^^^ ^^^^^ expenditures. HIV can 

bases of exon 5 were caUed m agreement with fte WT 35 ijiy become resistant to drugs used to treat the infection, 

sequence (see Buchman et a].. 1988, Ge«e 70: 245-252. ^^^.^ ^ heterodimeric protein (51 

mcorporatcd herein by reference). T^e miscalled bases were ^^^^^ transcriptase (RT) encoded by 

from posiuons where probe signal intenaties were tied ^ , ^ > ^ ^. ^ ^^j^ ^^^^ ^^^^^ „f 

^ ; I o'f^' r,r°",;^ probes had the highest signal RTproteinl believed to account for the hypermutability 

mtensity (4^9%). FIG 27 dlustrates how the actual sequence 40 ^^^^ ^^^j . ^^j^ 

was read. Gaps m the sequence of letter in the WT rows commonly used to Ueat HIV infection are converted to 

correspond to wntrol probes or sites Positions at which J^ ^y sequential phosphorylation in the 

bases are miscaUed are represented by letters mitahe type in .^^^^^ ^^^^ incorporation of the 

cells corresponding to probes in which the WT bases have ^J^j^ j^,^ ^j^^, ^^^^ termination of viral 

been substituted by other bases. « replication, because the 5'-.3' phosphodiester linkage can- 

As the diagram indicates, the miscaUed bases are ftom the ^ completed. However, wiihin rfter 6 months to 1 year 

tow intensity areas of the unage. which may be due to j^j^ ^ ^ ^ ^^ 

secondary structtue m the target or prob« preven mg mter- ^^^^ incorporating the analogue and so 
molecular hybndizaUon. To dimmish the effects due to .^sistant to treatment. Several known mutations are shown 

secondary structure, one can employ shorter targets (i.e.. by . 50 j^jji^j^j. j.^^ ^^jjj^ 
target fiagmeniation) or use more stringent hybridization 

conditions. In addition, the use of a set of probes synthesized ' 

by tiling across the other strand of a duplex target can also bt mutahons associated wrrH drug resistance 
provide sequence information buried in secondary structure 

in the other strand. It should be appreciated, however, that 55 anii- 
the pattern of low intensity areas that forms as a result of 
secondary structure in the target itself provides a means to 
identify that a specific target sequence ispresentin a sample. 
Other factors that may contribute to lower signal intensities 

include differences in probe densities and hybridization 60 ^ 
stabilities. 

These results demonstrate the advantages provided by the 
DNA chips of the invention to genetic analysis. As another 
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example, heterozygous mutations are currently sequenced ^^^^^^ ^^^^^^^ ^^^^^ ^^^^^ ^ ^^^^ ^ ^„ 

by an arduous process involving cloning and repurification 65 

of DNA. The cloning step is required, because the gel The present invention provides DNA chips for detecting 

sequencing systems are poor at resolving even a 1:1 mixture the multiple mutations in the HIV RT gene associated with 
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resistance to different therapeutics. These DNA chips will 
enable physicians to monitor mutations over time and to 
change therapeutics if resistance develops. The DNA chip 
will provide redundant confirmation of conserved HIV RT 
and other gene sequences, and the probes on the chip will tile 
through, with overlap, in important mutational hot spot 
regions. The chip will optionally have probes that span the 
entire coding region of the RT and optionally the genes for 
other HIV proteins, such as coat proteins. HIV target nucleic 
acid can be isolated from blood samples (peripheral blood 
lymphocytes or PBMC) and amplified by PCR, primers for 
which are shown in the table below. 
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to gain primary structure information of the DNA target. 
This format has important applications in sequencing by 
hybridization, DNA diagnostics and in elucidating the ther- 
modynamic parameters affecting nucleic acid recognition. 

Conventional DNA sequencing technology is a laborious 
procedure requiring clectrophorctic size separation of 
labeled DNA fragments. An alternative approach, termed , 
Sequencing By Hybridization (SBH), has been proposed 
(Lysov et al., 1988, DokLAkad, NaukSSSR 303: 1508-1511; 
Bains et al., 1988, /. Theon Biol. 135 : 303-307; and 
Dnmanac et al., 1989, Genomics 4: 114-128, incorporated 
herein by reference). This method uses a set of short 



AMPLIFICAnON OF TARGET 

TARGET 

SIZE PRIMER 1 PRIMER 2 



1, 742bp GTAGAArrCTGTrGACTCAGArrGG GATAAGCITGGGCCTrArcTArrCCAr 

(SEQ ID. NO:292) (SEQ ID. NO:294) 

535bp AAArCCATACAATACTCCAGTAnTGC AOCCATCCAAAGGAArGGAGGTIUnTC 

(SEQ ID. NO:293) (SEQ ID. NO:295) 

323bp Genbank#K02013 1889-1908 bases 2211-2192 



The HIV RT gene chips of the invention, as well as the CF, 
mtDNA, and p53 DNA chips of the invention, illustrate the 
diverse application of the methods and probe arrays of the 
invention. The examples that follow describe methods for 
preparing nucleic acid targets from samples for application 
to the DNA chips of the invention and provide additional 
details of the methods of the invention. 

EXAMPLES 
I. VLSIPS"™ Technology 

As noted above, the VLSI PS ^" technology is described in 
a number of patent publications and is preferred for making 
the oligonucleotide arrays of the invention. For 
completeness, a brief description of how this technology can 
be used to make and screen DNA chips is provided in this 
Example and the accompanying Figures. In the VLSIPS 
method, light is shone through a mask to activate functional 
(for oligonucleotides, typically an — OH) groups protected 
with a photoremovablc protecting group on a surface of a 
solid support. After light activation, a nucleoside building 
block, itself protected with a photoremovablc protecting 
group (at the 5'— OH), is coupled to the activated areas of 
the support. The process can be repeated, using different 
masks or mask orientations and building blocks, to prepare 
very dense arrays of many different oligonucleotide probes. 
The process is illustrated in FIG. 28; FIG. 29 illustrates how 
the process can be used to prepare "nucleoside combinato- 
rials" or oligonucleotides synthesized by coupling all four 
nucleosides to form dimers, trimers, etc. 

New methods for the combinatorial chemical synthesis of 
peptide, polycarbamate, and oligonucleotide arrays have 
recently been reported (see Fodor et al., 1991, Science 251: 
767-773; Cho et al., 1993, Science 261: 1303-1305; and 
Southern et al., 1992, Genomics 13: 1008-10017, each of 
which is incorporated herein by reference). These arrays, or 
biological chips (sec Fodor et al, 1993, Nature 364: 
555-556, incorporated herein by reference), harbor specific 
chemical compounds at precise locations in a high-density, 
information rich format, and are a powerful tool for the 
study of biological recognition processes. A particulariy 
exciting application of the array technology is in the field of 
DNA sequence analysis. The hybridization pattern of a DNA 
target to an array of shorter oligonucleotide probes is used 



oligonucleotide probes of defined sequence to search for 
complementary sequences on a longer target strand of DNA. 
The hybridization pattern is used to reconstruct the target 
DNA sequence. It is envisioned that hybridization analysis 
of large numbers of probes can be used to sequence long 
stretches of DNA. In immediate applications of this hybrid- 
ization methodology, a small number of probes can be used 
to interrogate local DNA sequence. 

The strategy of SBH can be illustrated by the following 
example. A 12-mer target DNA sequence, 
35 AGCCTAGCTGAA, (SEQ. ID NO:296) is mixed with a 
complete set of octanucleotide probes. If only perfect 
complementarity is considered, five of the 65^36 octamer' 
probes -TCGGATCG. CGGATCGA, GGATCGAC. 
GATCGACT, and ATCGACIT will hybridize to the target. 
40 Alignment of the overlapping sequences from the hybridiz- 
ing probes reconstructs the complement of the original 
12-mer target: 



TCGGATCG 
CGGATCGA 
GGATCGAC 
GATCGACT 
ATCGACIT 
TOGGATCGACrr (SEQ. ID NO:297) 



. Hybridization methodology can be carried out by . attaching 
target DNA to a surface. The target is interrogated with a set 
of oligonucleotide probes, one at a time (see Strezoska et al., 
1991, Prvc. Natl. Acad. Sci, USA 88: 10089-10093, and 

55 Drmanac el al., 1993, Science 260: 1649-1652, each of 
which is incorporated herein by reference). This approach 
can be implemented with well established methods of immo- 
bilization and hybridization detection, but involves a large 
number of manipulations. For example, to probe a sequence 

60 utilizing a full set of octanuclcotides, tens of thousands, of 
hybridization reactions must be performed. Alternatively, 
SBH can be carried out by attaching probes to a surface in 
an array format where the identity of the probes at each site 
is known. The target DNA is then added to the array of 

65 probes. The hybridization pattern determined in a single 
experiment directly reveals the identity of all complemen- 
tary probes. 
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As noted above, a preferred method of oligonucleotide 
probe array synthesis involves the use of light to direct the 
synthesis of oligonucleotide probes in high-density, minia- 
turized arrays. Photolabile 5'-protected N-acyl- 
deoxynucleoside phosphoramidites, surface linker 
chemistry, and versatile combinatorial synthesis strategics 
have been developed for this technology. Matrices of 
spatially-defined oligonucleotide probes have been 
generated, and the ability to use these arrays to identify 
complementary sequences has been demonstrated by 
hybridizing fluorescent labeled oligonucleotides to the DNA 
chips produced by the methods. The hybridization pattern 
demonstrates a high degree of base specificity and reveals 
the sequence of oligonucleotide targets. 

The basic strategy for light-directed oligonucleotide syn- 
thesis (1) is outlined in FIG. 28. The surface of a solid 
support modified with photolabile protecting groups (X) is 
illuminated through a photolithographic mask, yielding 
reactive hydroxyl groups in the illuminated regions. A 
3'-0-phosphoramidite activated deoxynucleoside (protected 
at the 5'-hydroxyl with a photolabile group) is then presented 
to the surface and coupling occurs at sites that were exposed 
to light. Following capping, and oxidation, the substrate is 
rinsed and the surface illuminated through a second mask, to 
expose additional hydroxyl groups for coupling. A second 
5'-protected, 3'-0-phosphoramidite activated deoxynucleo- 
side is presented to the surface. The selective photodepro- 
tcction and coupling cycles are repeated until the desired set 
of products is obtained. 

light directed chemical synthesis lends itself to highly 
efficient synthesis strategies which will generate a maximum 
number of compounds in a minimum number of chemical 
steps. For example, the complete set of 4n polynucleotides 
(length n), or any subset of this set can be produced in only 
4xn chemical steps. See FIG. 29. The patterns of illumina- 
tion and the order of chemical reactants uhimately define the 
products and their locations. Because photolithography is 
used, the process can be miniaturized to generate high- 
density arrays of oligonucleotide probes. For an example of 
the nomenclature useful for describing such arrays, an array 
containing all possible octanucleotides of dA and dT is 
written as (A+T)®. Expansion of this polynomial reveals the 
identity of all 256 octanucleotide probes from AAAAAAAA 
to TTTTTTTT. A DNA array composed of complete sets of 
dinucleotides is referred to as having a complexity of 2. The 
aaay given by (A+T+C+G)8 is the fiill 65,536 octanucle- 
otide array of complexity four. 

To carry out hybridization of DNA targets to the probe 
arrays, the arrays are mounted in a thermostatically con- 
trolled hybridization chamber. Fluorescein labeled DNA 
targets are injected into the chamber and hybridization is 
allowed to proceed for Vi to 2 hours. The surface of the 
matrix is scanned in an epifiuorescence microscope (Zeiss 
Axioscop 20) equipped with photon counting electronics 
using 50-100 /iW of 488 nm excitation from an Argon ion 
laser (Spectra Physics model 2020). All measurements are 
acquired with the target solution in contact with the probe 
matrix. Photon counts are stored and image files are pre- 
sented after conversion to an eight bit image format. See 
FIG. 33. 

When hybridizing a DNA target to an oligonucleotide 
array, N=Lt-(Lp-l) complementary hybrids arc expected, 
where N is the number of hybrids, Lt is the length of the 
DNA target, and Lp is the length of the oligonucleotide 
probes on the array. For example, for an U-mer hybridized 
to an octanucleotide array, N«4, Hybridizations with mis- 
matches at positions that are 2 to 3 residues from either end 
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of the probes will generate detectable signals. Modifying the 
above expression for N, one arrives at a relationship esti- 
mating the number of detectable hybridizations (Nd) for a 
DNA target of length Lt and an array of complexity C. 

5 Assuming an average of 5 positions giving signals above 
background: Nd=(l+5(C-l))[Lt-(Lp-l)]. 

Arrays of oligonucleotides can be efficiently generated by 
light-directed synthesis and can be used to determine the 
identity of DNA target sequences. Because combinatorial 

10 strategies are used, the number of compounds increases 
exponentially while the number of chemical coupling cycles 
increases only linearly. For example, expanding the synthe- 
sis to the complete set of 4^ (65,536) octanucleotides will 
add only four hours to the synthesis for the 16 additional 

15 cycles. Furthermore, combinatorial synthesis strategies can 
be implemented to generate arrays of any desired composi- 
tion. For example, because the entire set of dodecamers (4^^) 
can be produced in 48 photolysis and coupling cycles (b" 
compounds requires bxn cycles), any subset of the dodecam- 

20 ers (including any subset of shorter oligonucleotides) can be 
constructed with the correct lithographic mask design in 48 
or fewer chemical coupling steps. In addition, the number of 
compounds in an array is limited only by the density of 
synthesis sites and the overall array size. Recent experi- 

25 ments have demonstrated hybridization to probes synthe- 
sized in 25 ;<m sites. At this resolution, the entire set of 
65,536 octanucleotides can be placed in an array measuring 
0.64 cm square, and the set of 1,048,576 dodecanucleotides 
requires only a 2.56 cm array. 

30 Genome sequencing projects will ultimately be limited by 
DNA sequencing technologies. Current sequencing method- 
ologies are highly reliant on complex procedures and require 
substantial manual effort. Sequencing by hybridization has 
the potential for transforming many of the manual efforts 

35 into more efficient and automated formats. Light-directed 
synthesis is an efificient means for large scale production of 
miniaturized arrays for SBH. The oligonucleotide arrays are 
not limited to primary sequencing applications. Because 
single base changes cause multiple changes in the hybrid- 

40 ization pattern, the oligonucleotide arrays provide a power- 
ful means to check the accuracy of previously elucidated 
DNA sequence, or to scan for changes within a sequence. In 
the case of octanucleotides, a single base change in the target 
DNA results in the loss of eight complements, and generates 

45 eight new complements. Matching of hybridization patterns 
may be useful in resolving sequencing ambiguities from 
standard gel techniques, or for rapidly detecting DNA muta- 
tional events. The potentially very high information content 
of light-directed oligonucleotide arrays will change genetic 

50 diagnostic testing. Sequence comparisons of hundreds to 
thousands of different genes will be assayed simultaneously 
instead of the current one, or few at a time format. Custom 
arrays can also be constructed to contain genetic markers for 
the rapid identification of a wide variety of pathogenic 

55 organisms. 

Oligonucleotide arrays can also be applied to study the 
sequence specificity of RNA or protein-DNA interactions. 
Experiments can be designed to elucidate specificity rules of 
non Watson-Crick oligonucleotide structures or to investi- 

60 gate the use of novel synthetic nucleoside analogs for 
antisense or triple helix applications. Suitably protected 
RNA monomers may be employed for RNA synthesis. The 
oligonucleotide arrays should find broad application deduc- 
ing the thermodynamic and kinetic rules governing forma- 

65 tion and stability of oligonucleotide complexes. 

Other than the use of photoremovable protecting groups, 
the nucleoside coupling chemistry is very similar to that 
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used routinely today for oligonucleotide synthesis. FIG. 30 
shows the deprotection, coupling, and oxidation steps of a 
solid phase DNA synthesis method. FIG. 31 shows an 
illustrative synthesis route for the nucleoside building blocks 
used in the method. FIG. 32 shows a preferred photoremov- 5 
able protecting group, MeNPOC, and how to prepare the 
group in active form. The procedures described below show 
how to prepare these reagents. The nucleoside building 
blocks are 5'-MeNPOC-THYMIDINE-3'-OCEP; 
5'-MeNP0C-N'*-t-BUTYL PHENOXYACETYL- 
DE0XYCYTIDINE-3'-0CEP; 5'-MeNP0C-N'*-t-BUTYL 
PHEN0XYACETYL-DE0XYGUAN0SINE-3'-0CEP; 
and 5'-MeNP0C-N'*-t-BUTYL PHENOXYACETYL- 
DE0XYADEN0SINE-3'-0CEP. 

A. Prieparation of 4, 5-methylencdioxy-2-nitroac6tophenone jj 
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minimum volume of CHjClj or THF(-175 ml) and then 
precipitating it by slowly adding hexane (1000 ml) while 
stirring (yield 51 g; 80% overall). It can also be recrystal- 
lized (eg., tolucne-hexane), but this reduces the yield. 
C. Preparation of l-(4,5- methylenedioxy-2-nitrophenyl) 
ethyl chloroformatc (McNPOC-Cl) 



COQj 
Tolucnc/THF 





A solution of 50 g (0.305 mole) 3,4- 25 
methylenedioxyacetophenone (Aldrich) in 200 mL glacial 
acetic acid was added dropwise over 30 minutes to 700 mL 
of cold (2-4'' C.) 70% HNO3 with stirring (NOTE: the 
reaction will overheat without external cooling from an ice 
bath, which can be dangerous and lead to side products). At 30 
temperatures below 0** C, however, the reaction can be 
sluggish. A temperature of 3°-5'' C. seems to be optimal). 
The mixture was left stirring for another 60 minutes at 3°-5° 
C, and then allowed to approach ambient temperature. 
Analysis by TLC (25% EtOAc in hexane) indicated com- 
plete conversion of the starting material within 1-2 hr. When 
the reaction was complete, the mixture was poured into ~3 
liters of crushed ice, and the resulting yellow solid was 
filtered off, washed with water and then suction-dried. Yield 
-53 g (84%), used without further purification. 
B. Preparation of l-(4,5-Methylenedioxy-2-nitrophenyl) 
ethanol 
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Phosgene (500 mLof 20% wA' in toluene fi-om Fluka; 965 
mmole; 4 eq.) was added slowly to a cold, stirring solution 
of 50 g (237 mmole; 1 eq.) of l-(4,5-methylenedioxy-2- 
nitrophenyl)ethanol in 400 mL dry THF. The solution was 
stirred overnight at ambient temperature at which point TLC 
(20% EtjO/hexane) indicated >95% conversion. The mix- 
ture was evaporated (an oil-less pump with downstream 
aqueous NaOH trap is recommended to remove the excess 
phosgene) to afford a viscous brown oil. Purification was 
effected by flash chromatography on a short (9x13 cm) 
column of silica gel eluted with 20% Et20^exane. Typically 
55 g (85%) of the solid yeUow MeNPOC-Cl is obtained by 
this procedure. The crude material has also been recrystal- 
lized in 2-3 crops from 1:1 ether/hexane. On this scale, -100 
ml is used for the first crop, with a few percent THF added 
to aid dissolution, and then cooling overnight at -20** C. (this 
procedure has not been optimized). The product should be 
stored dessicated at -20* C. 

D. Synthesis of 5'-MeNPOC-2'-DEOXYNUCLEOSIDE-3'- 
(N,N-DIISOPROPYL 2-CYANOETHYL PHOSPHORA- 
MIDITES 
(1) S'-MeNfPOC-Nucleosides 



V-0 



Sodium borohydride (10 g; 0.27 mol) was added slowly 
to a cold, stirring suspension of 53 g (0.25 mol) of 4,5- 
methylenedioxy-2-nitroacetophenone in 400 mL methanol. 
The temperature was kept below 10° C. by slow addition of 
the NaBH4 and external cooling with an ice bath. Stirring 
was continued at ambient temperature for another two hours, 
at which time TLC (CHjCy indicated complete conversion 
of the ketone. The mixture was poured into one liter of 
ice-water and the resulting suspension was neutralized with 
ammoniiun chloride and then extracted three times with 400 
mL CHjClj or EtOAc (the product can be collected by 
filtration and washed at this point, but it is somewhat soluble 
in water and this results in a yield of only -60%). The 
combined organic extracts were washed with brine, then 
dried with MgS04 and evaporated. The crude product was 
purified from the main byproduct by dissolving it in a 
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Base 



Base=THYMIDINE (T); N-4-IS0BUTYRYL 
60 2'-DE0XYCYTIDINE (ibu-dC); N-2-PHENOXYACETYL 
2'DEOXYGUANOSINE (PAC-dG); and N-6- 
PHENOXYACETYL 2'DEOXYADENOSINE (PAC-dA) 

All four of the 5 '-MeNPOC nucleosides were prepared 
from the base-protected 2'-deoxynucleosides by the follow- 
65 ing procedure. The protected 2*-deoxynucleoside (90 
mmole) was dried by co-evaporating twice with 250 mL 
anhydrous pyridine. The nucleoside was then dissolved in 
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300 mL anhydrous pyridine (or 1:1 pyridine/DMF, for the For products in the 200 to 1000 bp size range, check 2 ^\ 

dG^^ nucleoside) under argon and cooled to -2° C. in an of the reaction on a 1.5% O.SxTBE agarose gel using an 

ice bath. A solution of 24.6 g (90 mmole) MeNPOC-Cl in appropriate size standard (phiX174 cut with Haelll is 

100 mL dry THP was then added with stirring over 30 convenient). The PGR reaction should yield several pico- 
minutes. The ice bath was removed, and the solution allowed 5 moles of product. It is helpful to include a negative control 

to stir overnight at room temperature (TLC: 5-10% MeOH (i.e., M TE instead of genomic DNA) to check for possible 

in CHjClj; two diastereomers). After evaporating the sol- contamination. To avoid contamination, keep PGR products 

vents under vacuum, the crude material was taken up in 250 from previous experiments away frorii later reactions, using 

mL ethyl acetate and extracted with saturated aqueous filter tips as appropriate. Using a set of working solutions 

NaHGOj and brine. The organic phase was then dried over and storing master solutions separately is helpful, so long as 

NajS04, filtered and evaporated to obtain a yeUow foam. one does not contaminate the master stock solutions. 

Hie crude products were finally purified by flash chroma- .^i^p « amplifications of short fragnients from 

tography (9x30 cm siUca gel column eluted with a stepped ^^^^ I^NA it is, in general, unnecessary to optmiize 

gradient of 2%-6% MeOH in CH.Cy . Yields of the pun- ^g^* concentrations^A good pro^^ 

I . , . . ^ • ♦u f n^cf make a master mix minus enzyme; dispense the genomic 

fied diastereomenc^^^^ ^5 ^ ,^ ,^ j^^^^^^l ^^^^ ^g^. 

DIISOPROPYL 2-CYANOETHYL soimion ,0 each well, using a new filter tip each time. 

PHOSPHORAMIDITES) 2) PURIFICAnON 

Removal of unincorporated nucleotides and primers from 
^g^^ > 20 PGR samples can be accomplished using the Promega 

Magic PGR Preps DNA purification kit. One can purify the 
whole sample, following the instructions supplied with the 
kit (proceed from section IIIB, 'Sample preparation for 
direct purification from PGR reactions'). After elution of the 
^ Base 25 PGR product in 50 fil of TE or HjO, one centrifuges the 

MenpocO >^ ^ eluate for 20 sec at 12,000 rpm in a microfuge and carefully 

transfers 45 to a new microfuge tube, avoiding any visible 
pellet. Resin is sometimes carried over during the elution 
step. This transfer prevents accidental contamination of the 
P— 0CH:CH:CN 30 linear amplification reaction with 'Magic PGR' resin. Other 
/ methods, e.g. size exclusion chromatography, may also be 

V^^v^ used. 
1 I 3) UNEAR AMPUFICAnON 

In a 0.2 mL thin-wall PGR tube mix: 4 ft\ purified PGR 
The four dcoxynucleosides were phosphitylated using 35 product; 2 /^l primer (10 pmol//il); 4 /^l lOxPGR buffer; 4 /<1 
either 2-cyanoethyl-N,N-diisopropyl dNTPs(2mMdA,dG,dG,0.1 mMdT);4;^l0.1 mMdUTP; 

chlorophosphoramidite, or 2-cyanoethyl-N,N,N',N'- 1 /il 1 mM fluorescein dUTP (Amersham RPN 2121); 1 U 
tetraisopropylphosphorodiamidite. The following is a typi- Taq polymerase (Perkin Ehner, 5 U//fl); and add HjO to 40 
cal procedure. Add 16.6, g (17.4 ml; 55 mmole) of /^l. Gonduct 40 cycles (92' G. 30 sec, 55" C. 30 sec, 72** C. 
2-cyanoethyl-N,N,N',N'-tetraisopropylphosphorodiamidite 40 90 sec) of PGR. These conditions have been used to amplify 
to a solution of 50 mmole 5'-MeNPOC-nucleoside and 4.3 a 300 nucleotide mitochondrial DNA fragment but are 
g (25 mmole) diisopropylammonium letrazolide in 250 mL generally applicable. Even in the absence of a visible 
dry GH2GI2 under argon at ambient temperature. Gontinue product band on an agarose gel, there should still be enough 
stirring for 4-16 hours (reaction monitored by TLG: product to give an easily detectable hybridization signal. If 
45:45:10 hexane/GHjaj/EtaN). Wash the. organic phase 45 one is not treating the DNA with uracil DNA glycosylase 
with saturated aqueous NaHGOg and brine, then dry over (see Section 4), dUTP can be omitted from the reaction. 
Na2S04, and evaporate to dryness. Purify the crude amidite 4) FRAGMENTATION 

by flash chromatography (9x25 cm silica gel column eluted Purify the linear amplification product using the Promega 
with hexane/GHjGlj/TEA -45:45:10 for A, G, T; or 0:90:10 Magic PGR Preps DNA purification kit, as per Section 2 
for G). The yield of purified amidite is about 90%. 50 above. In a 0.2 mL thin-wall PGR tube mix; 40 /4 purified 

II. PREPARATION OF LABELED DNA/ labeled DNA; 4 /d lOxPGR buffer; and 0.5 //I uracilDNA 
HYBRIDIZAnON TO ARRAY glycosylase (BRL lU/jul). Incubate the mixture 15 min at 

1) PGR 37* C., then 10 min at 97" G.; store at -20° C. until ready 

PGR amplification reactions are typically conducted in a to use. 
mixture composed of per reaction: 1 fi\ genomic DNA; 10//1 55 5) HYBRlDIZAnON SGANNING & STRIPPING 
each primer (10 pmol///! stocks); 10/^1 lOxPGR buffer (100 A blank scan of the slide in hybridization buffer only is 
mM Tris.a pH8.5, 500 mM KGl, 15 mM MgGlj); 10 fd 2 helpful to check that the sUde is ready for use. The buffer is 
mM dNTPs (made from 100 mM dNTP stocks); 2.5 U Taq removed from the flow- cell and replaced with 1 mL of 
polymerase (Perkin Elmer AmpliTaq™, 5 U//d); and H^O to (fragmented) DNA in hybridization buffer and mixed well. 
100 fil The cycling conditions are usually 40 cycles (94" C. 60 The scan is performed in the presence of the labeled target. 
45 sec, 55" C. 30 sec, 72" G. 60 sec) but may need to be FIG. 33 illustrates an illustrative detection system for scan- 
varied considerably from sample type to sample type. These ning a DNA chip. A series of scans at 30 min intervals using 
conditions are for 0.2 mL thin wall tubes in a Perkin Elmer a hybridization temperature of 25" G. yields a very clear 
9600 thermocycler. See Perkin Elmer 1992/93 catalogue for signal, usually in at least 30 min to two hours, but it may be 
9600 cycle time information. Target, primer length and 65 desirable to hybridize longer, i.e., overnight. Using a laser 
sequence composition, among other factors, may also affect power of 50 //W and 50 fim pixels, one should obtain 
parameters. maximum counts in the range of hundreds to low thousands/ 



5,837,832 

33 34 

pixel for a new slide. When finished, the slide can be 30 sec) are performed, but cycling conditions may need to 

stripped using 50% formamide. rinsing well in deionized be varied, liiese conditions are for 0.2 mL thin wall tubes in 

HjO, blowing dry, and storing at room temperature. Perkin Elmer 9600. For products in the 200 to 1000 bp size 

III. PREPARATION OF LABELED RNA range, check 2/<l of the reaction on a 1.5% O.SxTDE agarose 

/HYBRIDIZATION TO ARRAY 5 gel using an appropriate size standard. For larger or smaller 

1) TAGGED PRIMERS volumes (20-100 /<1), one can use the same amount of 
The primers used to amplify the target nucleic acid should genomic DNA but adjust the other ingredients accordingly. 

have promoter sequences if one desires to produce RNA 4) IN VITRO TRANSCRIPTION 

from the amplified nucleic acid. Suitable promoter Mix: 3 PGR product; 4 /il 5xbuffer; 2 ;il DTT; 2.4 /il 10 

sequences are shown below and include: lO mM rNTPs (100 mM solutions from Pharmacia); 0.48 fit 10 

(1) the T3 promoter sequence: mM fluorescein-UTP (Fluorescein-12-UTT, 10 mM 
5'-CGGAAITAACCCTCACrAAAGG (SEQ. ID NO:298) solution, from Boehringer Mannheim); 0.5 fd RNA poly- 
5'-AArTAACCCTCACTAAAGGGAG; (SEQ. ID NO;299) merase (Promega T3 or T7 RNA polymerase); and add HjO 

(2) the T7 promoter sequence: to 20 //I. Incubate at 37** C. for 3 h. Check 2 ;d of the reaction 
5* TAATACGACrCACTATAGGGAG; (SEQ. ID NO:300) 15 on a 1.5% O.SxTBE agarose gel using a size standard, 
and (3) the SP6 promoter sequence: 5xbuffer is 200 mM Tris pH 7.5, 30 mM MgCl^, 10 mM 
5* AmAGGTGACACTATAGAA. (SEQ. ID NO:301) spermidine, 50 mM NaCl, and 100 mM DTT (supplied with 
The desired promoter sequence is added to the 5* end of the enzyme). The PGR product needs no purification and can be 
PGR primer. It is convenient to add a different promoter to added directly to the transcription mixture. A 20 ft\ reaction 
each primer of a PGR primer pair so that either strand may 20 is suggested for an initial test experiment and hybridization; 
be transcribed from a single PGR product. a 100 pi\ reaction is considered "pr^arative" scale (the 

Synthesize PGR primers so as to leave the DMT group on. reaction can be scaled up to obtain more target). The amount 

DMT-on purification is unnecessary for PGR but appears to of PGR product to add is variable; typically a PGR reaction 

be important for transcription. Add 25 /d 0.5M NaOH to will yield several picomoles of DNA. If the PGR reaction 

collection vial prior to collection of oligonucleotide to keep 25 does not produce that much target, then one should increase 

the DMT group on. Deprotect using standard chemistry — the amount of DNA added to the transcription reaction (as, 

55® G. overnight is convenient. well as optimize the PGR). The ratio of fluorescein-UTP to 

HPLG purification is accomplished by drying down the UTP suggested above is 1:5, but ratios from 1:3 to 1:10 — all 

oligonucleotides, resuspending in 1 mL 0.1 M TEAA (dilute work well. One can also label with biotin-UTP and detect 

2.0M stock in deionized water, filter through 0.2 micron 30 with streptavidin-FITG to obtain similar results as with 

filter) and filter through 0.2 micron filter. Load 0.5 mL on fluorescein-UTP detection. 

reverse phase HPLG (column can be a Hamilton PRP-1 For nondenaturing agarose gel electrophoresis of RNA, 

semi-prep, #79426). The gradient is 0-*50% GHjCN over note that the RNA band will normally migrate somewhat 

25 min (program 0.2 /miol.prep.0-50, 25 min). Pool the faster than the DNA template band, although sometimes the 

desired fractions, dry down, resuspend in 200 80% HAc. 35 two bands will comigrate. The temperature of the gel can 

30 min RT. Add 200 fd EtOH; dry down. Resuspend in 200 effect the migration of the RNA band. The RNA produced 

//I HjO, plus 20 /il NaAc pH5.5, 600 /A EtOH. Leave 10 min from in vitro transcription is quite stable and can be stored 

on ice; centrifuge 12,000 rpm for 10 min in microfuge. Pour for months (at least) at -20** G. without any evidence of 

off supernatant. Rinse pellet with 1 mLEtOH, dry, resuspend degradation. It can be stored in unsterilized 6xSSPE 0.1% 

in 200 ft\ H20. Dry, resuspend in 200 /il TE. Measure A260, 40 triton X- 100 at -20" G. for days (at least) and reused twice 

prepare a 10 pmol/wl solution in TE (10 mM Tris.Cl pH 8.0, (at least) for hybridization, without taking any special pre- 

0.1 mM EDTA). Following HPLG purification of a 42 mer, cautions in preparation or during use, RNase contamination 

a yield in the vicinity of 15 nmol from a 0.2 //mol scale should of course be avoided. When extracting RNA from 

synthesis is typical. cells, it is preferable to work very rapidly and to use strongly 

2) GENOMIC DNA PREPARATION 45 denaturing conditions. Avoid using glassware previously 
For obtaining genomic DNA from human hair, one can contaminated with RNases. Use of new disposable plas- 

extract as few as 5 hairs, including hair roots. On a clean and ticware (not necessarily sterilized) is preferred, as new 

sterile surface, one places the hair on a piece of parafilm, and plastic tubes, tips, etc., are essentially RNase free. Treatment 

after wiping a new razor blade with EtOH cutting off the with DEPG or autoclaving is typically not unnecessary, 

roots, the roots are transferred to a 1.5 mL microfiige tube 50 5) FRAGMENTATION 

using a pair of Millipore forceps cleaned with EtOH. Add In a 0.2 mL thin-wall PGR tube mix: 18 fi\ RNA (direct 

500 /d (10 mM Tris.a pH8,0, 10 mM EDTA, 100 mM NaQ, from transcription reaction— no purification required); 18 

2% (w/v) SDS, 40 mM DTT, filter steriHzed) to the sample. HjO; and 4 jul IM Tris.a pH9.0. Incubate at 99.9** G. for 60 

Add 1.25 ^1 20 mg/ml proteinase K (Boehringer) Incubate at min. Add to 1 mL hybridization buffer and store at -20® G. 

55® G. for 2 hours, vortexing once or twice. Perform 2x0.5 55 until ready to use. The alkaline hydrolysis step is very 

mL 1:1 phenohCHClg extractions. After each extraction, reliable, llie hydrolysed target can be stored at -20® G, in 

centrifuge 12,000 rpm 5 min in a microfiige and recover 0.4 6xSSPE/0.1% Triton X-100 for at least several days prior to 

mL supernatant. Add 35 NaAc pH5.2 plus 1 mL EtOH. use and can also be reused. 

Place sample on ice 45 min; then centrifuge 12,000 rpm 30 6) HYBRIDIZAHON SCANNING, & STRIPPING 

min, rinse, air dry 30 min, and resuspend in 100 /il TE. 60 A blank scan of the slide in hybridization buffer only is 

3) PGR helpful to check that the slide is ready for use. The buffer is 
PGR is performed in a mixture containing, per reaction: 1 removed from the flow cell and replaced with 1 mL of 

^ genomic DNA; 4 ^\ each primer (10 pmol//il stocks); 4 fA (hydrolysed) RNA in hybridization buffer and mixed well. 

10 xPGR buffer (100 mM Tris.a pH8.5, 500 mM KGl, 15 Incubate for 15-30 min at 18® G. Remove the hybridization 

mM MgCIj); 4 /*1 2 mM dNTPs (made from 100 mM dNTP 65 solution, which can be saved for subsequent experiments, 

stocks); 1 U Taq polymerase (Perkin Ehner, 5 U/jul); HjO to Rinse the flow cell 4-5 times with fresh changes of 6xSSPE/ 

AOfil About 40 cycles (94® C. 30 sec, 55® G. 30 sec, 72® G. 0.1% Triton X-100, equilibrated to 18® C. The rinses can be 
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performed rapidly, but it is important to empty the flow cell 
before each new rinse and to mix the liquid in the cell 
thoroughly. The scan is performed in the presence of the 
labeled target. A series of scans at 30 min intervals using a 
hybridization temperature of 25° C. yields a very clear 5 
signal, usually in at least 30 min to two hours, but it may be 
desirable to hybridize longer, i.e., overaight. Using a laser 
power of 50 /<W and 50 /im pixels, one should obtain 
maximum counts in the range of hundreds to low thousands/ 
pixel for a new slide. When finished, the slide can be lO 
stripped using 50% to 100% formamide at 50° C. for 30 min, 
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rinsing well in deionized HjO, blowing dry, and storing at 
room temperature. 

These conditions are illustrative and assume a probe 
length of -15 nucleotides. The stripping conditions sug- 
gested are fairly severe, but some signal may remain on the 
slide if the washing is not stringent. Nevertheless, the counts 
remaining after the wash should be very low in comparison 
to the signal in presence of target RNA. In some cases, much 
gentler stripping conditions are effective. The lower the 
hybridization temperature and the longer the duration of 
hybridization, the more difficult it is to strip the slide. Longer 
targets may be more difficult to strip than shorter targets. 



SEQUENCE USTING 



( 1 ) GENERAL INFORMAHON: 

( i i i )^^U^^BER OF SEQUENCES: 360 



( 2 ) INFORMAnON FOR SEQ ID N0:1: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGFH: L5 base pain 
( B ) TYPE: miclcic acid 
( C ) SfRANDEDNESS: tia^e 
( D ) TOPOLOGY: linear 

{ i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIFnON: SEQ ID N0:1: 

TTGCTGACGT CAGCC IS 



( 2 ) INFORMAnON FOR SEQ ID N0:2: 

( i ) SEQUENCE CHARACTERlSmcS: 
( A ) LENGTH: 15 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS; single 
C D ) TOPOLOGY: linear 

( i i )MOLECUIJE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:2: 



< 2 ) INFORMAnON FOR SEQ © N0:3: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 base pain 
( B ) TYPE: miclcic acid 
( C ) STRANDEDNESS: glngtc 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

TTGCTGACCT CAGCC 



( 2 ) INFORMAnON FOR SEQ ID N0:4: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 base pira 
( B ) TYPE: andcic acid 
( C ) SFRANDEDNESS: single 
( D ) TOPOLOGY: linear 



( i i ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
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TTGCTGACTT CAGCC 

( 2 ) INFORMAnON FOR SEQ [D N0:5: 

( i ) SEQUENCE CHARACTERrsnCS: 
( A ) LENCTH: 39 base pairs 
( B ) TYTE nucleic Dcid 
( C ) STRANDEDNESS: siaglc 
( D ) TOPOLOGY: (hear 

( i I ) MOLECULE TYPE: DKA (oligonucleotide) 

C X i ) SEQUENCE DESCRIPTION: SEQ ID N0:5: 

CATTAAAGAA AATATCATCT TTGGTGTTTC CTATGATGA 

{ 2 ) INFORMAnON FOR SEQ ID N0:6: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 36 base paira 
( B ) TYPE: nucleic Bcid 
( C ) STRANDEDNESS: single 
( D )T0P0L0GY: linear 

C i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:6: 

CATTAAAGAA AATATCATTG GTGTTTCCTA TGATGA 

( 2 ) INFORMATION FOR SEQ ID N0:7: 

. ( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 36 base paira 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( X i ) SEQUENCE DESCRIFOON: SEQ ID N0:7: 

CATTAAAGAA AATATCATTG GTGTTTCCTA TGATGA 

{ 2 ) INFORMATION FOR SEQ ID NO:a: 

( i ) SEQUENCE CHARACIERISnCS: 
( A ) LENGTH: 15 base pain 
( B ) TVPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D)TOPOLOGY: linear 

( 1 i ) MOLECULE TVPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:8: 

AACACCAATG ATGAT 



( 2 ) INFORMAnON FOR SEQ ID N0:9: 

( i ) SEQUENCE CHARACTERISTICS: 
( A)LENGTH:t5ba»epIrs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: stable 
(D)TOPOLOGY: linear 

( i I ) MOLECULE TVTE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ 0) N0:9: 

CCAAAGATNA TATTT 



( 2 ) INFORMAnON FOR SEQ ID NO:10: 
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( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENCTH: 15 base pain 
( B ) TYPE: nucleic acid 
( C ) SmiANDEDNESS: single 
{ D)TOPOLOGY; lineal 

( i i ) MOLECULE TYPE: DNA (probe) 

C X i ) SEQUENCE DESCRIPTTON: SEQ ID NO:10: 

ACCAAACANG ATATT 

( 2 ) [NFORMAnON FOR SEQ ID NO;ll: 

( i ) SEQUENCE CHARACTERISTICS: 
{ A ) LENCTTH: 15 base pairs 
( B ) TYPE: niwleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:11: 

CACCAAAGNT GATAT 



( 2 ) INFORMATION FOR SEQ ID N0:12: 

( i ) SEQUENCE CHARACreRISnCS: 
( A ) LENCTH: 15 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D)TOPOLOGY: Itncai 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIFnON: SEQ ID N0:12: 

ACACCAAANA TGATA 



( 2 ) INFORMAnON FOR SEQ ID N0:13: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: IS base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: smgic 
(D)TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:13: 

AACACCAANG ATGAT 15 



( 2 ) INFORMAnON FOR SEQ ID N0:14: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 base pairs 
( B )TYP& nucleic acid 
( C ) STRANDEDNESS: single 
( D )TOPOLOCY: linear 

( i i )M0LECUIE TYPE; DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:14: 

AAACACCANA GATGA * 15 



( 2 ) lOTORMAnON FOR SEQ ID N0:15: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: IS base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: stagte 
( D ) TOPOLOGY; linear 



41 



5,837,832 

-continued 



42 



C i i ) MOLECULE TYPE: DNA (probe) 
C X i ) SEQUENCE DESCRIPTION; SEQ \D N0:15: 
GAAACACCNA AGATG 

( 2 ) INFORMAnON FOR SEQ ID N0:16: 

( i ) SEQUENCE CHARACTERISTICS: 
( A)LENGTH:15basepairs 
( B)TYPE: nucleic add . 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIFRON: SEQ ID N0:16: 

GGAAACACNA AAGAT 

( 2 ) INFORMAnON FOR SEQ ID NO: 17: 

( i ) SEQUENCE CHARACTERISTICS: 
( A)LENGTH: 15 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS; single 
( D ) TOPOLOGY; linear 

( i I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

AGGAAACANC AAAGA 

( 2 ) tNFORMAnON FOR SEQ ID N0:18: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 2t base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:18: 

CCTTCAGAGG GTAAAATTAA G 

( 2 ) INFORMAnON FOR SEQ ID N0:19: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 21 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D )T0P0LOGY: linear 

C i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:19: 

CCTTCAGAGT GTAAAATTAA G 



( 2 ) INFORMAnON FOR SEQ ID NO;20: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 44 base pairs 
( B )TirPE: nucleic acid 
C C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
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TAATACGACT CACTATAGGC AGATGACCTA ATAATGATGG GTTT 44 



( 2 ) INFORMAnON.FOR SEQ ID N0:21: 

, ( i ) SEQUENCE CHAHACTERISTtCS: 
( A )LENGTH:43bMepairs 
( B ) TYPE nucleic acid 
( C ) STRANDEDNESS: single 
( D)TOPOLOCY: linear 

( i i ) MOLECULE TYPE- DNA (probe) 

C X i ) SEQUENCE DESCRIPTION: SEQ ID N0:21: 

TAATACGACT CACTATAGGC AGTAGTGTGA AGGGTTCATA TOC 43 



( 2 ) INFORMAnON FOR SEQ ID N0:22: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 45 base pairs 
( B ) TYPE nucleic acid 
( C ) STRANDEDNESS: single 
( D)T0P0L0GY: linear 

( i i )MOLECUUE TYPE DNA (probe) 

( 3t i ) SEQUENCE DESCRIFnON: SEQ ID NO:22: 

CTCGGAATTA ACCCTCACTA AAGGTAGTGT GAAGGGTTCA TATGC 45 



( 2 ) INFORMAnON FOR SEQ ID NO:23: 

( i ) SEQUENCE CHARACmiSnCS: 
( A ) LENGTH: 43 base pairs 
( B ) TVPE nucleic acid 
< C ) STRANDEDNESS: single 
( D) TOPOLOGY: linear 

( i I ) MOLECULE TYPE DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

TAATACGACT CACTATAGGC AGAGCATACT AAAAGTGACT CTC 43 



( 2 ) INFORMATION FOR SEQ ID NO:24: 

( i ) SEQUENCE CHARACTERISTICS; 

( A ) LENGTH: 44 base pairs 
( B ) TYPE nucleic add 
( C ) STRANDEDNESS: single 
( D)TOPOLOGY: linear 

C i 1 ) MOLECULE TVPE DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

TAATACGACT CACTATAGGC AGACATGAAT GACATTTAC A . GC AA 44 



( 2 ) INFORMAnON FOR SEQ ID N0:2S: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH' 44 base pairs 
( B )'mE nucleic acid 
( C ) STRANDEDNESS: single 
(D)TOPOLOGY: linear 

( i i ) MOLECULE TVPE DNA (probe) 

( K I ) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

CGGAATTAAC CCTCACTAAA GGACATGAAT GACATTTACA GCAA 44 



( 2 ) INFORMAnON FOR SEQ ID NO:26: 
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( i ) SEQUENCE CHARACTERISTICS; 

( A ) LENCTTH: 12 base pairs 
( B ) TYPE: cDclcic acid 
( C ) STOANDEDNESS: aingle 
{ D )T0P01.0GY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPHON: SEQ ID N0:2& 

TTTATGOOGT OA 



( 2 ) INFORMAnON FOR SEQ ID NO:27: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base paira 
( B ) TYPE: nucleic acid 
( C ) ynUNDEDNESS: single 
< D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE; DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

TTGATTTATG GG 



( 2 ) INFORMAnON FOR SEQ ID NO:28: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: U base paira 
( B ) TYPE: nucleic acid 
( C ) JTRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

f X i ) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

AACCTATTTG ATT 



( 2 ) INFORMAnON FOR SEQ ID NO:29: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS; single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

GCACCAAACC TA 



( 2 ) INFORMAnON FOR SEQ ID NO:30: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pair* 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: sbgle 
( D ) TOPOLOGY linear 

( I i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPnON: SEQ ID NO:30: 

AGGCTAGGAC CA 



( 2 ) INFORMAnON FOR SEQ ID N0:31: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pai» 
( B ) rVPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE: DXA (probe) 
( X i ) SEQUENCE DESCRimON: SEQ ID N0:31: 
GGTGTOTOTG TGC 

( 2 ) INFORMAnON FOR SEQ ID NO:32: 

( i ) SEQUENCE CHARACIERISnCS: 
( A ) LENGTH: 14 base pain 
( B ) TYPE: nucleic acid 
( C ) CTRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i 1 ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

CGGTGTGTGT GTGC 

( 2 ) INFORMAnON FOR SEQ ID NO:33: 

( i ) SEQUENCE CHARACTERISTICS: , 
( A ) LENGTH: U base paira 
{ B ) TYPE: nucleic acid 
( C ) CTRANDEDNESS: single 
( D )TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRimON: SEQ ID NO:33: 

GGT.GTGTGTG TGCT 

( 2 ) INFORMAnON FOR SEQ ID NO:34: 

( i ) SEQUENCE CHARACTERISTICS: 
( A)LENGTH: 12 base pairs 
( B ) TYPE nucleic acid 
( C ) HANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (Fobe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

CTGGGTAGGA TG 

( 2 ) INFORMAnON FOR SEQ ID NO:35: 

( ! ) SEQUENCE CHARACreRISnCS: 
( A ) LENGTH; 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) HANDEDNESS: single 
(D)TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRimON: SEQ ID NO:35: 

TGCTGGGTAG GA 



( 2 ) INFORMAnON FOR SEQ ID N0:3fi: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY linear 

( 1 i ) MOLECULE TYPE- DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:36: 
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TGTGCTGGGT AG 

( 2 ) INFORMAnON FOR SEQ ID NO;37: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: naclcic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY; lineai 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

GTTAOCAGCG GT 

( 2 ) INFORMAnON FOR SEQ ID NO:38: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) yrRANDEDNESS: smglc 
( D ) TOPOLOGY; Ibeai 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTTON: SEQ ID NO:38: 

GGGTTAGCAG CG 

( 2 ) INFORMATION FOR SEQ ID NO:39: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:39: 

AGCGGGGGAO G 

( 2 ) INFORMATION FOR SEQ ID N0:40: 

( i ) SEQUENCE CHARACTERISTICS: 
( A )LEN(7rH: 10 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D)T0POLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:40: 

AGCGGGGGAG 



( 2 ) INFORMATION FOR SEQ ID N0;41: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY; linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:41: 

GGTTGGTTCG G 



( 2 ) INFpRMAnON FOR SEQ ID NO:42; 
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C i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C )CTRANDEDNESS: sin^c 
( D ) TOPOLOGY: Imcai 

( i . i ) MOLECULE TYPE: DNA (probe) 

C X 1) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

GGGTTTGGTT GO 

( 2 ) INFORMAnON FOR SEQ ID NO:43: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (prabe) 

C X i ) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

GATCTTTGGG GT 

( 2 ) INFORMAHON FOR SEQ ID NO:44: 

( i ) SEQUENCE CHARACIERrsnCS: 
( A ) LENCTTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

C i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

GGGTGATCTT TG 

( 2 ) INFORMAnON FOR SEQ ID NO:45: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:45: 

TGTGGGGGGT GA 

( 2 ) INFORMAnON FOR SEQ ID N0:4<i: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENCTTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

TAAACTGTGG GG 



( 2 ) INFORMAnON FOR SEQ ID NO:47: 

( i ) SEQUENCE CHARACTERISncS: 
( A ) LEN'GTH: 13 base pain 
( B ) TYPE: nocleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE: DNACprobc) 
(x I ) SEQUENCE DESCRIFnON: SEQ ID NO:47: 
GCTACATAAA CTG 



( 2 ) INFORMAnON FOR SEQ © NO:48: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pain 
( B ) TYPE: nucleic odd 
( C ) STBANDEDNESS: sbgle 
( D ) TOPOLOGY: Ibcai 

C i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:48: 

GAGGTAAGCT ACA 



( 2 ) INFORMAnON FOR SEQ ID NO:49: 

( i ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 12 base paiis 
( B )TYPE: nucleic acid 
( C ) SrraANDEDNESS: single 
( D ) TOPOLOGY: lincai 

( i i ) MOLECULE TYPE; DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:49: 

GAGGAGGTAA GC 



( 2 ) INFORMAnON FOR SEQ ID NO:50:. 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic ocid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:50: 

TGCTTTGAGG AG 



( 2 ) INFORMAnON FOR SEQ D) N0:S1: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D )T0P0LOGY:lmear 

( i j ) MOLECULE TYPE: DNA (Fobe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:51: 

AGTGTATTGC TTT 



( 2 ) INFORMAnON FOR SEQ ID NO:52: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

C i i ) MOLECULE TYPE: DNA (probe) 



( X ! ) SEQUENCE DESCRIFnON: SEQ U> NO:52: 
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CATTTTCAGT GTA 

( 2 ) INFORMAnON FOR SEQ ID NO:53: 

( i ) SEQUENCE CHARACTERtSmCS: 
( A ) LENGTH: 13 base pain 
( B ) TYPE: nndcic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

C I I ) MOLECULE TYPE; DNA (probe) 

C jt i ) SEQUENCE DESCRIPTION: SEQ ID NO:53: 

TAAACATTTT CAG 

C 2 ) INFORMAnON FOR SEQ ID NO:54: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: smgle 
( D ) TOPOLOGY: linear 

C I I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:54: 

AGCCCGTCTA AA 

( 2 ) INFORMATION FOR SEQ ID NO:55: 

( i ) SEQUENCE CHARACTERISnCS: 
( A ) LEN'OTH; 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i 1 ) MOLECULE TfPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:55: 

GAGCCCGTCT AA 

( 2 ) INFORMAnON FOR SEQ ID NO:56: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENCTH: 12 base pairs 
( B ) TYPE: nncleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TirPE: DNA (ptobe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:56: 

TGATGTGAGC CC 



( 2 ) INFORMAnON FOR SEQ ID NO:57; 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE; nncleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (pobe) 

( X 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:57: 

GGGGTOATCT GA 



( 2 ) INFORMAnON FOR SEQ ID N0:S8: 
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( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TW^ nucleic acid 
( C ) SniANDEDNESS: single 
( D )T0P0LCM3Y: linear 

(. i i ) MOLECULE TVPE: DNA (probe) 

(x \ ) SEQUENCE DESCRIPTION: SEQ ID NO:58: 

GAGTGGCAGO G 



( 2 ) INFORMAnON FOR SEQ ID NO:59: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TVTE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE; DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:59: 

GTATGGGAGT GG 



( 2 ) INFORMAnON FOR SEQ ID NO:60: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LEN'GTH: 14 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:60: 

GATTAGTAGT ATGG 



( 2 ) INFORMAnON FOR SEQ ID N0:61: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) JY?E: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i )MOLECULETYPE- DNA (probe) 

( X 1 ) SEQUENCE DESCRIPTION: SEQ ID N0:61: 

TGAATGAGAT TAG 



( 2 ) INFORMAnON FOR SEQ ID NO:62: 

( i ) SEQUENCE CHARACIERISnCS: 
( A)LENGTH: 13 base pairs 
( B ) TYPE; nucleic acid 
( C ) STRANDEDNESS: single 
( D) TOPOLOGY linear 

( i i ) MOLECULE TVTE; DNA (probe) 

( X 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:62: 

ATTGAATGAG ATT 



( 2 ) INFORMAnON FOR SEQ ID NO:63: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pain 
( B ) TYPE: nucleic acid 
( C ) STOANDEDNESS: shgle 
( D ) TOPOLOGY: linear 



59 



5,837,832 

-continued 



60 



( i i ) MOLECULE TYPE: DNA (probe) 
( X i ) SEQUENCE DESCRIPTION: SEQ ED NO:63: 
GGGTTGTATT GAA 

( 2 ) INFORMAnON FOR SEQ U> NO:64: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucleic acid 
( C ) HANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 

( 3t i ) SEQUENCE DESCRIPTION: SEQ ID NO:64: 

GCGGGGGTTG 

( 2 ) INFORMAnON FOR SEQ ID NO:65: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH- 10 base pairs 
( B ) TYPE: nucleic acid 
{ C ) SmtANDEDNESS: single 
( D) TOPOLOGY linear 

C i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:6S; 

ATGGGCGGGG 

( 2 ) INFORMAnON FOR SEQ D N0:66: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (iprobe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:66: 

TAGGATGGGC G 



( 2 ) INFORMAnON FOR SEQ ID NO:67: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: smgle 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

C X i ) SEQUENCE DESCRIFnON: SEQ ID NO:67: 

TGGOTAGGAT GG 



( 2 ) INFORMAnON FOR SEQ ID NO:68: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D)TOPOLOCY: linear 

( i i ) MOLECULE TYPE; DNA (probe) 



( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:68: 
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GTGCTOGCTA GO 

( 2 ) INFORMAHON FOR SEQ ID N0:(i9: 

( i ) SEQUENCE CHARACTERISTiCS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY; linear 

( i i ) MOLECULE TYPE; DNA (probe) 

C X i ) SEQUENCE DESCRIPTION: SEQ ID NO:69: 

TGTGTGTGCT GO 

( 2 ) INFORMATION FOR SEQ ID NO:70: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:70: 

GCGGTGTGTG TO 

( 2 ) INFORMATION FOR SEQ ID N0:7l: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Imcar 

( i I ) MOLECULE TYPE: DNA (probe) 

( X 1 ) SEQUENCE DESCRIPTION: SEQ ID N0:71: 

TAGCAGCGGT GT 

( 2 ) INFORMATION FOR SEQ ID N0:72: 

( i ) SEQUENCE CHARACTERISTICS; 

( A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:72: 

TGGGOTTAGC AG . 



( 2 ) INFORMAnON FOR SEQ ID NO:73: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: smgle 
( D ) TOPOLOGY: Ihiear 

( i i ) MOLECULE TVTE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:73: 

GGTATCCCGT TA 



( 2 ) INFORMAHON FOR SEQ ID NO; 74: 
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( i ) SEQUENCE CHARACTERIffnCS: 
( A ) LENC3TH: 12 base pairs 
( B ) TYPE: nucleic acid 
. ( C ) STRANDEDNESS: single 
. ( D ) TOPOLOGY: linear 

( i 1 ) MOLECULE TYPE: DNA (probe)' 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:74: 

GTTCGGOGTA TO 

( 2 ) INFORMAnON FOR SEO ID N0:7S: 

( i ) SEQUENCE CHARACTERISTICS: 
< A)l£NGTH:l2 baie pairs 
( B ) TYPE: nucleic acid 
( C ) SFRANDEDNESS: single 
( D)T0POLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:7S: 

GCTGGTGTTA GG 

( 2 ) INFORMAnON FOR SEO CD NO:76: 

( i ) SEQUENCE CHARACTERISTICS: 
( A)L£NG7H: L2bue pairs 
( B ) TYPE; nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:76: 

GGTTAGGCTG GT 

( 2 ) INFORMAnON FOR SEQ ID NO:77: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) UENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D) TOPOLOGY: linear 

( i i ) MOLECULE TVTE: DNA (probe) 

C X i ) SEQUENCE DESCRIFnON: SEQ ID NO:77: 

AAATCTGGTT AGG 

( 2 ) INFORMAnON FOR SEQ ID NO:78: 

( i ) SEQUENCE CHARACTERISTICS: 
( A )LENCrH: Ubascpairs 
( B ) TYPE: nucleic acid 
( C ) SmiANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I I ) MOLECULE rVPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:78; 

AAATTTGAAA TCT 



( 2 ) INFORMAnON FOR SEQ ID NO:79: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) SniANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TVTEiDNA (probe) 
( 3C i ) SEQUENCE DESCRIPTION: SEQ [D NO:79: 
AAGATAAAAT TTG 

( 2 ) INFORMAITON FOR SEQ ID NO:80: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) ffraANDEDNESS: smglc 
( D )TOPOLOGY; linear 

( i i ) MOLECULE TYPE: DNACptobe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:80: 

GCCAAAAAGA TA 

( 2 ) INFORMAnON FOR SEQ ID N0:81: 

( i ) SEQUENCE CHARAOnERISnCS: 
( A ) LENGTO: 11 base palis 
( B ) TYPE: nucleic Bcld 
( C ) STRANDEDNESS: single 
( D)TOPOLOGY: linear 

C i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIFHON: SEQ ID NO:81: 

CGCCAAAAAG A . 



( 2 ) INFORMAnON FOR SEQ ID NO:82: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TSfPE:DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:82: 

CATACCGCCA A 



( 2 ) INFORMAnON FOR SEQ ID NO:83: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) lENCTH: 13 base pairs 
( B )TYPE:ttDcleic acid 
( C ) STRANDEDNESS: smgle 
• . (D)TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:83: 



( 2 ) INFORMAnON FOR SEQ ID NO:84: 



( I ) SEQUENCE CHARACIERISnCS: 
( A)LENGTH: Ubase pairs 
. ( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 



( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:84: 
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TGTTAAAAGT GCA 

( 2 ) INFORMAnON FOR SEQ ID NO;85: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH; 13 base pain 
( B ) TYPE: nucleic acid 
( C ) CTRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE ryPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:85: 

GGGTGACTGT TAA 

( 2 ) [NFORMAnON FOR SEQ ID NO:86: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
{ C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO;86: 

GGGGOTGACT GT 

( 2 ) INFORMAHON FOR SEQ ID NO:87: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: U base pairs 
( B ) TYPE: nucleic add 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( J i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:87: 

AGTTGCGGGG T 

( 2 ) INFORMAnON FOR SEQ ID NO:88: 

( i ) SEQUENCE CHARACTERISTICS: 
( A)LENOTH:U base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: sbgle 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:88: 

TGTGTTAGTT - GGG 



{ 2 ) INFORMAnON FOR SEQ ID NO:89: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) UENCTTH: 13 base pairs 
( B )rSfPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Ibear 

( i i ) MOLECULE TYPE: DNA (iprobe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:89: 

AAAATAATGT GTT 



( 2 ) INFORMAnON FOR SEQ ID NOiPO: 
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( I ) SEQUENCE CHARACTER[SnCS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D )TOPOLOCY: linear .. 

( i i ) MOLECULE TYPE: DNA (probe) 

( X t ) SEQUENCE DESCRIPTION: SEQ ID N0:90: 

AGGGGAAAAT AA 



( 2 ) INFORMAnON FOR SEQ ID N0:9l: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B )TVPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:91: 

GGAGGGGAAA AT 



( 2 ) INFORMAnON FOR SEQ ID NO:92: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:92: 

GGAAATTTTT TG 



( 2 ) INFORMAnON FOR SEQ ID NO:93: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:93: 

GGTGGAAATT TT 



( 2 ) INFORMAnON FOR SEQ ID NO:94: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: U base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 1 ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:94: 

GGTTTGGTGG A 



( 2 ) INFORMAnON FOR SEQ ID NO:95: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE: DNA (probe) 
( x i ) SEQUENCE DESCRIPTION: SEQ ID N0:9S: 
GAGGGGOGGT T 

( 2 ) INFORMAnON FOR SEQ ID NO:96: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pain 
( B ) TYPE: nucleic odd 
( C ) HANDEDNESS: single 
( D ) TOPOLOGY: linear 

C i i ) MOLECULE TYPE; DNA (probe) 

( X i ) SEQUENCE DESCRIITION: SEQ ID NO:96: 

GCGGGGGAGG 

( 2 ) INFORMAnON FOR SEQ CD NO:97: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE; nucleic add 
( C ) CTRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i )MOl£CUIJE TYPE: DNA (probe) 

( x I ) SEQUENCE DESCRIPTION: SEQ ID NO:97: 

CAGAAGCGGG G 

( 2 ) INFORMAnON FOR SEQ ID NO:98: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (pobc) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:98: 

GTAOGCCAGA AG 

{ 2 ) INFORMAnON FOR SEQ ID NO:99: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B )TYPE:nudcic(>cid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY; linear 

( i I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:99: 

GTGCTGTAGG CC 



( 2 ) INFORMAnON FOR SEQ ID NO:100: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:100: 
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TGTTTAAGTG CTG 1^ 

( 2 ) [NFORMAnON FOR SEQ 0) NO:101: 

( i ) SEQUENCE CHARACreRISnCS: 
( A ) LENCTH; 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA(pobc) 

( x 1 ) SEQUENCE DESCRIPTION: SEO 0) NO:101: 

TGTCTTTAAG TGC 13 

( 2 ) INFORMAnON FOR SEQ ID NO:102: 

< I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D )TOPOLOGY: linear 

( i i )M01JECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ © N0:102: 

GCAGAGATGT GTT 13 

( 2 ) INFORMATION FOR SEQ ID NO:103; 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D )T0P0LOGY: linear 

C i 1 ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:103: 

TTTGGCAGAG AT 12 

( 2 ) INFORMATION FOR SEQ ID NO:104: 

( i ) SEQUENCE CHARACTERISnCS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D)T0P0LOGY: linear 

( i 1 ) MOLECULE TYPE; DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:1W: 

GGGGTTTGGC A 11 

( 2 ) INFORMAnON FOR SEQ ID NO:105: 

( I ) SEQUENCE CHARACTERISmCS: 
( A)LENCTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: sfaglc 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:105: 

TGTTTTTGGG CT 12 



( 2 ) INFORMAnON FOR SEQ ID NO:106: 
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( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE; nucleic acid 
( C ) HANDEDNESS: sragle 
( D )TOP0LOGY: lincai 

( i i ) MOLECULE TVTE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTTON: SEQ ID NO:106: 

TTTGTTTTTG GG 

( 2 ) INFORMAnON FOR SEQ ID NO:107: 

( i ) SEQUENCE CHARACTERISTICS: 
{ A ) LENGTH: 12 base pairs 
( B ) TYPE: nacieic acid 
( C ) ynUNDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:lCr7: 

GGGTTCTTTG TT 

{ 2 ) INFORMAnON FOR SEQ ID NO:108: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: U base pairs 
( B ) TYPE: nnclcic acid 
( C ) CTRANDEDNESS: single 
( D ) TOPOLOGY linear 

( i i ) MOLECULE TYPE: DNA (piobc) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:108: 

GTGTTAGGGT TCT 

( 2 ) INFORMAnON FOR SEQ ID NO:109: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 14 base paira 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Ibear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:10P: 

TTTAGTAAGT ATGT 

( 2 ) INFORMAnON FOR SEQ ID NO:U0: 

( i ) SEQUENCE CHARACTERISTICS: 
{ A ) LENGTH: 13 base pairs 
( B )TYPE: nnclcic acid . 
( C ) STRANDEDNESS: smgle 
( D)TOP0LOGY linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

AACACACTTT AGT 



( 2 ) INFORMAnON FOR SEQ ID N0:111: 

( i ) SEQUENCE CHARACTBRISnCS: 
( A ) LENGTH: 14 base pairs 
( B ) TYPE: nndcic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE: DNA (probe) 
, ( X i ) SEQUENCE DESCRimON: SEQ U> N0:1U: 
AATTAATTAA CACA 



( 2 ) [NFORMAnON FOR SEQ ID NO: 112: 

( i ) SEQUENCE CHARACreRISnCS: 
( A ) LENGTH: 13 base pain 
( B )TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:U2: 

AAGCATTAAT TAA 



( 2 ) INFORMAnON FOR SEQ ID N0:113: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pain 
( B ) TYPE: nnclcic acid 
( C ) ffTRAXDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:U3: 

CTCCTACAAG CAT . 

( 2 ) INFORMAnON FOR SEQ ID N0:1U: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
C B ) TYPE: nucleic odd . 
( C ) STRANDEDNESS: sragle 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:ll4: 

TGTCCTACAA GCA 



( 2 ) INFORMAnON FOR SEQ ID N0:11S: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic 6cid 
( C ) SFRANDEDNESS: smgle 
( D )T0P0LpGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ tt> N0:115: 

ATTATTATGT CCT 



( 2 ) [NFORMAnON FOR SEQ ID NO: 116: 

( i ) SEQUENCE CHARACIERISnCS: 
( A ) LENGTH: U base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 



( X I ) SEQUENCE DESCRIPTION: SEQ © N0:U6: 
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TTGTTATTAT TATG 



( 2 ) INFORMAnON FOR SEQ ID N0:U7: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 

C X i ) SEQUENCE DESCRIPTTON: SEQ ID N0:U7: 

ATTCAAATTG TTA 



( 2 ) INFORMAnON FOR SEQ ID NO: 118: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) SniANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

C 31 1 ) SEQUENCE DESCRIPTION: SEQ ID N0:U8: 

GCAGACATTC AAA 



( 2 ) INFORMATION FOR SEQ ID N0:U9; 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE nucleic acid 
( C ) STRANDEDNESS: single 
( D )TOPOLOGY: linear 

( i 1 ) MOLECULE TYPE: DNA (probe) 

C X 1 ) SEQUENCE DESCRIPTION: SEQ ID N0:119: 

GCTOTGCAGA CA 



( 2 ) INFORMATION FOR SEQ ID NO:120: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: Qocleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIFTION: SEQ ID NO:120: 

AAAGTGGCTG TG 



( 2 ) INFORMAnON FOR SEQ ID N0:121: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 i ) MOLECULE TYPE; DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:121: 



( 2 ) INFORMAnON FOR SEQ ID N0:122: 
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( I ) SEQUENCE CHARACIERrsnCS: 
( A ) LENGTH: U base pairs 
( B ) TYPE: nnclcic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Ibear 

( i i ) MOLECULE TYPE: DNA (probe) 

C X i ) SEQUENCE DESCRIPTION: SEQ ID NO:122: 

CATGTCTGTG TGG 

( 2 ) ENFORMATION FOR SEQ D NO:123: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: U base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

C i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:123: 

ATGATGTCTG TGT 

( 2 ) INFORMAnON FOR SEQ ID NO:124: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: U base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X 1 ) SEQUENCE DESCRIPTION: SEQ ID NO;124: 

TTTTGTTATG ATG 

( 2 ) (NFORMAnON FOR SEQ P N0:125: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: U base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY linear 

( i i ) MOLECULE TYPE; DNA (probe) 

( X 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:l25: 

TTTTTTGTTA TGA 

( 2 ) INFORMAnON FOR SEQ ID NO: 126: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid ■ 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:126: 

ATAGGGTGCT CC 



( 2 ) INFORMAnON FOR SEQ ID NO:127: 

( i ) SEQUENCE CHARACTERISTICS; 

( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY linear 
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( i i ) MOLECULE TYPE: DNA (probe) 
( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:127: 
GCOACATAGG GT 



( 2 ) [NFORMATION FOR SEQ ID N0:12& 

( J ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pain 
{ B ) TYPE: nucleic ocid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:128: 

TACTGCGACA TAG 



( 2 ) INFORMAnON FOR SEQ ID N0:129: 

( i ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 13 base pain 
( B ) TYPE: nucleic add 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TVPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:l29: 

GACAGATACT GCG 

( 2 ) INFORMAnON FOR SEQ ID NO; 130: 

( J ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B j-mE: nucleic add 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:130: 

AATCAAAGAC AGA 



( 2 ) INFORMAnON FOR SEQ ID N0:131: 

( i ) SEQUENCE CHARACTERISTICS: 
( A)LENGrH:13base pairs 
( B ) TYPE: nucleic acid 
( C ) mANDEDNESS: single 
( D)T0P0L0GY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:131: 

AGOAATCAAA GAC 



( 2 ) INFORMAnON FOR SEQ ID NO:132: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) smiANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:132: 
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TGAGOCAGGA AT 

( 2 ) [NFORMAnON FOR SEQ ID NO:133: 

( i ) SEQUENCE CHARACTERISmCS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STftANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEO ID N0:n3: 

AGGATGAGGC AG 

( 2 ) INFORMAnON FOR SEQ ID NO: 134: 

( i ) SEQUENCE CHARACTERISTICS; 

( A ) LENGTH: 13 base palra 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D)T0P0LOGY: linear 

( 1 i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:134: 

AAATAATAGG ATG 

( 2 ) INFORMAnON FOR SEQ ID NO; 135: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic add 
( C ) STRA>TEDNESS: single 
( D)T0POLOGY: linear 

( i I ) MOLECULE TYPE; DNA (probe) 

C X i ) SEQUENCE DESCRIPTION: SEQ ID NO;135: 

GCGATAAATA AT 

( 2 ) INFORMATION FOR SEQ ID NO:136: 

( i ) SEQUENCE CHARACTERISTICS; 

( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D)T0POLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:136: 

TAGGATGCGA TA 



( 2 ) INFORMAnON FOR SEQ ID NO: 137: 

( i ) SEQUENCE CHARACTERISTICS: 
( A)LENCrTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) SrRA>rDEDNESS: single 
( D)TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:137: 

GTAGGATGCG AT 



( 2 ) INFORMAnON FOR SEQ ID NO:138: 
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( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: ainglc 
( D )TOPOLOGV: linear 

( i i )M0LECUIE TYPE; DXA (probe) . 

C X I ) SEQUENCE DESCRIPTION: SEQ ID NO:l38: 

TTGAACGTAO GA 



( 2 ) INFORMAHON FOR SEQ ID NO:139; 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pain 
( B ) TYPE; Quclcic acid 
( C ) yrRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:U9: 

AATATTGAAC GTA 



( 2 ) INFORMAnON FOR SEQ ID NO:140: 

( i ) SEQUENCE CHARACrERISTICS: 
( A ) LENCTTH: 13 base pain 
( B ) TYPE: nncleic acid 
( C ) CTRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( ii )MOLECULE TYPE; DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:140: , 

GCCTGTAATA TTG 



( 2 ) INFORMAnON FOR SEQ ID NO:Ul: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nncleic acid 
( C ) STRAKDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TVPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:14l: 

TGTTCGCCTG TA 



( 2 ) INFORMAnON FOR SEQ ID NO:142: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRAKDEDNESS: single 
( D ) TOPOLOGY: linear 

( M ) MOLECULE TYPE- DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:142: 

OTATGTTCGC CT 



( 2 ) INFORMAnON FOR SEQ ID N0:143: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nncleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE: DNA (probe) 
( X i ) SEQUENCE DESCWPTTON: SEQ ID NO; 143: 
CTCCCOTCAG TG 



( 2 ) OIFORMATION FOR SEQ ID N0:144: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRA>T>EDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

C X i ) SEQUENCE DESCRIFHON: SEQ ID N0:144: 

GAGAGCTCCC GT 



( 2 ) INFORMAnON FOR SEQ ID NO:145: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
(D ) TOPOLOGY: linear 

( i i )MOLECUl£ TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:14S: 

ATGCAGAGCT CC 

( 2 ) INFORMAnON FOR SEQ ID NO; 146: 

( i ) SEQUENCE CHARACTERISTICS: 
( A)LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:t46: 

AATGCATGGA GA 



( 2 ) INFORMAnON FOR SEQ ID NO:147: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (pobe) 

( X i •) SEQUENCE DESCRIPTION: SEQ ID N0:147; 

ATACCAAATG CA 



( 2 ) INFORMAnON FOR SEQ ID NO: 148: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) UENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) CTRANDEDNESS; single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:148: 
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GACGAAAATA CCA 

( 2 ) INFORMAnON FOR SEQ ID NO:149: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: audcic acid 
( C ) yrRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 

C X i ) SEQUENCE DESCRIPTION: SEQ ID NO:149: 

CCCAGACGAA A 

( 2 ) INFORMAnON FOR SEQ ID NO: 150. 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) CTRANDEDNESS: single 
( D) TOPOLOGY: linear 

C i i ) MOLECULE TYPE; DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:150: 

TACCCCCCAG A 

( 2 ) INFORMAOON FOR SEQ ID NO: 151: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:151: 

TGCATACCCCC 

( 2 ) INFORMAnON FOR SEQ ID N0:1S2: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base paii5 
( B ) TYPE: nucleic acid 
< C ) STRANDEDNESS: sfagle 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (pobe) 

( X i ) SEQUENCE DESCRIPTTON: SEQ ID N0:lS2: 

TCGCGTGCAT AC . 



( 2 ) INFORMAnON FOR SEQ ID NO:153: 

( i ) SEQUENCE CHARACTERISTICS: 
( A)LENGTH:t2basepaiis 
( B ) TYPE- nnclelc acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i 1 ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:153: 

GACTATCGCG TO 



( 2 ) INFORMAnON FOR SEQ ID NO:154; 
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C i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) ffTRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE; DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:154: 

ATGACTATCG CO 



( 2 ) INFORMAnON FOR SEQ ID N0:155: 

( t ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( j i ) MOLECULE TYPE- DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:l55: 

CTCGCAATGA CT 



( 2 ) INFORMAnON FOR SEQ ID N0:156: 

C i ) SEQUENCE CHARACTERISTICS: 
( A ) LENCTH: 12 base pairs 
( B ) TVPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNAO»obe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:156: 

CGTCTCGCAA TG 



( 2 ) INFORMAnON FOR SEQ ID NO:157: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:157: 

CTCCAGCGTC TC 



( 2 ) INFORMAnON FOR SEQ ID N0:158: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE nucleic acid 
( C ) STRANDEDNESS: smgle 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (ptobt) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:158: 

TCCGGCTCCA G 



( 2 ) INFORMAnON FOR SEQ ID N0:lS9: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: II base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE Tfm DXA (probe) 
( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:159: 
GTGCTCCGGCT 



( 2 ) INFORMAnON FOR SEQ ID NO:160: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:160: 

GACCCTGAAG TAG 



( 2 ) INFORMAnON FOR SEQ ID N0:16l: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nticlcic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

C i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ [D N0:16l: 

TTTATGACCC TGA 



( 2 ) INFORMAnON FOR SEQ ID NO:162: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE- nucleic acid 
( C ) STRA>fDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:162: 

TTTAGGCTTT ATG 



( 2 ) INFORMAnON FOR SEQ ID NO:163: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:163: 

GCTATTTAGG CT 



( 2 ) INFORMAnON FOR SEQ ID NO:164: 

( i ) SEQUENCE CHARACIERISnCS: 
( A ) LENGTH: 12 base pairs 
( B ) Tf?E: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE rVPE: DNA (probe) 



( X I ) SEQUENCE DESCRIPTION: SEQ ID N0:1M: 
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TGGGCTATTT AG 



( 2 ) INFORMAnON FOR SEQ ID N0:16S: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TVPE; nucleic acid 
( C ) STRANDEDNESS: smglc 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRimON: SEQ ID NO:165: 

ACGTGTGGGC TA 



( 2 ) INFORMAnON FOR SEQ ID NO: 166: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B JtYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

C i i ) MOLECULE TYPE; DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:166: 

AGGGGAACGT GT 



( 2 ) INFORMATION FOR SEQ ID NO:167: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D) TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:I67: 

TTTAAGGGGA AC 



( 2 ) INFORMAnON FOR SEQ ID NO:168: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 14 base pairs 
( B )TVPE: nucleic acid 
( C ) STRANDEDNESS: stogie 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:168: 

ATGTCTTATT TAAG 



( 2 ) INFORMAnON FOR SEQ ID NO:169: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
{ B )TyPE:nacleicacid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:169: 

CATCGTGATG TCT 



( 2 ) INFORMAnON FOR SEQ D N0:l7a 
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( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH; 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D )T0POLOGY: linear 

( i i ) MOLECULE TVPE; DNA (probe) ' 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:170: 

TCCATCGTGA TG 



( 2 ) INFORMAnON FOR SEQ ID N0:171: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
{ B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:171: 

GATGATCCAT CG 



( 2 ) INFORMAnON FOR SEQ ID NO:172: 

( i ) SEQUENCE CHARACTERISTICS; 

( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X 1 ) SEQUENCE DESCRIPTION: SEQ ID n6:172: 

AGACCTGATG ATC 



( 2 ) INFORMAnON FOR SEQ ID NO:173: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ D NO:173: 

GGGTGATAGA CCT 



( 2 ) INFORMAnON FOR SEQ ID NO:174: 

( i )SEQUENCECHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANT>EDNESS: shgle 
( D ) TOPOLOGY linear 

( I i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIFHON: SEQ ID NO:174: 

ATAGGGTGAT AG A 



( 2 ) INFORMAnON FOR SEQ ID N0:175: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) SmiANDEDNESS: single 
( D ) TOPOLOGY: linear 
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C i i ) MOLECULE TYPE- DNA (probe) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:175: 
TCGTTAATAG GG 



( 2 ) INFORMAnON FOR SEQ ID NO:176: 

( i ) SEQUENCE CHARAOnERlSnCS: 
( A ) LENGTH: U base pain 
( B )TYPE:iincl«ic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRimON: SEQ ID NO:176; 

GTGAGTGGTT AAT 



( 2 ) INFORMAnON FOR SEQ ID NO:l77: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRAKDEDNESS: single 
( D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:177: 

TGTGCGGGAT AT 



( 2 ) INFORMAnON FOR SEQ ID NO:178: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B )TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D)TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:178: 

ACTCTTGTGC GG 



( 2 ) INFORMAnON FOR SEQ ID NO:179: 

( i ) SEQUENCE CHARACTERISTICS: 
( A)LENGTH: 13 base pairs 
( B ) TYPE: nnclcic acid 
( C ) STRANDEDNESS: single 
( D)TOPOLOGY: linear 

( j i )MOLECULETYPE; DNA (probe) 

( X i ) SEQUENCE DESCRimON: SEQ ID N0:179: 

TAGCACTCTT GTG 



( 2 ) INFORMAnON FOR SEQ ID NO:180: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 



( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:180: 
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GGAGAGTAGC ACT 



( 2 ) INFORMAnON FOR SEQ ID NO: 181: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucUic acid 
( C ) STRANDEDNESS: sbgle 
( D)TOPOLOGY: linear 

( i 1 ) MOLECULE TYPE; DNA (probe) 

C X i ) SEQUENCE DESCRIPTION: SEQ ID N0:181: 

GCGAGGAGAG TA 



( 2 ) INFORMAnON FOR SEQ ID NO: 182: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH; 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) SniAKDEDNESS: single 
( D)TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:182: 

CGGAGCGAGG A 



( 2 ) ENFORMATTON FOR SEQ ID NO:183: 

( i ) SEQUENCE CHARACTERISTICS: 
( A j LENGTH: 10 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: sbgle 
( D ) TOPOLOGY: linear 



( 1 1 ) MOLECULE TYPE: DNA (probe) 



( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:183: 
GGCCCGGAGC 



( 2 ) INFORMATION FOR SEQ ID NO:184: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
{ D ) TOPOLOGY: linear 

( i 1 ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:18+: 



( 2 ) INFORMAnON FOR SEQ ID NO: LBS: 

( i ) SEQUENCE CHARACTERISTICS: 
( A)LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ D NO:185: 



( 2 ) INFORMAnON FOR SEQ ID NO:186: 
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( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B ) TVPE: nucleic acid 
( C ) SmiANTIEDNESS: single 
( D)TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTTON: SEQ ID N0:186: 

TACCCCCAAO TG 



( 2 ) INFORMAnON FOR SEQ ID NO: 187: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base paira 
( B ) TYPE: Ducleic acid 
( C ) yTRANDEDNESS: single 
( D ) TOPOLOGY; linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:187: 

TTTAGCTACC CC 



( 2 ) INFORMAnON FOR SEQ ID NO: 188: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TVTE: DNA (probe) 

( )t I ) SEQUENCE DESCRIPTION: SEQ ID NOtl88: 

TTCACTTTAG CTA 



( 2 ) INFORMAnON FOR SEQ ID NO:189: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: U base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE nPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:189: 

TACAGTTCAC TTT 



( 2 ) INFORMAnON FOR SEQ ID NO:190: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) CTRANDEDNESS: single 
( D) TOPOLOGY: linear 

( i I ) MOLECULE TYPE- DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:190: 

TCGAGATACA CTT 



( 2 ) INFORMAnON FOR SEQ ID N0:I91: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base plrs 
( B ) TVPE: nndcic acid 
( C ) STRANDEDNESS: single 
( D)TOPOLOGY:lhcar 
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( i i ) MOLECULE TVTEDNA (probe) 
( x i ) SEQUENCE DESCRIPTION: SEQ ID N0:191: 
CAGATGTCGA GAT 

( 2 ) INFORMAnON FOR SEQ ID NO:192: 

( i ) SEQUENCE CHARACTERtSTICS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nodeic ocid 
( C ).CTRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:192: 

AGOAACCAGA TG 

( 2 ) INFORMAnON FOR SEQ ID NO:193: 

( 1 ) SEQUENCE CHARACTERISTICS: 
(A)LENGTH:Ubascpain 
( B infPE: nucleic acid 
( C ) STRA>a)EDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIFHON: SEQ ID N0:193: 

GAAGTAGGAA CCA 

( 2 ) INFORMAnON FOR SEQ ID N0:194: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B )TYPE;nneleicocid 
( C ) SniANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TVTE; DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 

GACTGTAATG TGC 

( 2 ) INFORMAnON FOR SEQ ID NO:195: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TVTE: nodeic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:19S: 

GGGATTTGAC TGT 



( 2 ) INFORMAnON FOR SEQ ID NO:196: 

( i ) SEQUENCE CHARACTERISTICS: 
( A)LENCTH:t2basepair» 
( B ) TYPE* niuleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:196: 
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AGGCATTTGA CT 

( 2 ) INFORMAnON FOR SEQ [D NO: 197: 

( i ) SEQUENCE CHARACTERISTICS: 
( A.) LENGTH: 12 base pairs 
( B ) TVPE: nocleic acid 
( C ) STRANDEDNESS: smgle 
( D ) TOPOLOGY: linear 

( 1 I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ D) N0:197: 

ACGAGAAGGG AT 

( 2 ) INFORMAnON FOR SEQ ID N0:198: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE nucleic add 
( C ) STRANDEDNESS: single 
( D)TOPOLOGY:lmcai 

(11) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:198; 

TGGGGACGAG AA 

( 2 ) INFORMATION FOR SEQ ID NO:199: 

( i ) SEQUENCE CHARACIERimCS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:199: 

ATCCATGGGG AC 

( 2 ) INFORMATION FOR SEQ ID NO:200: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANTtEDNESS: single 
( D ) TOPOLOGY; linear 

( 1 i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:200: 

GGTCATCCAT GG 



( 2 ) INFOR\(AnON FOR SEQ ID NO:201: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE; nucleic Bcid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Ibear 

( i I ) MOLECULE TYPE; DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:20l: 

AGGGCGGTCA T 



( 2 ) INFORMAnON FOR SEQ ID NO:202: 
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( I ) SEQUENCE CHARACTERISTICS: 
( A)LENGTH: 12 base pairs 
( B ) TYPE: BDcleic acid 
( C ) SmiANDEDNESS: atnglc . 
.( D ) TOPOLOGY: linear . 

( i i ) MOLECULE TYPE: DNA (probe) ' 

( ji i ) SEQUENCE DESCRIPTION: SEQ ED NO:202: 

TATCTGAGGG GO 

( 2 ) INFORMAnON FOR SEQ ID NO;203: 

C i ) SEQUENCE CHARACTERISmCS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: anclcic acid 
( C ) CTRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 

C X i ) SEQUENCE DESCRIPTION; SEQ ID NO:203: 

ACCCCTATCT GA 



( 2 ) INFORMAnON FOR SEQ ID NO:204: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTR- 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:204: 

AGGGACCCCT A 



( 2 ) INFORMAnON FOR SEQ ID NO:205: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B )TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Imcar 

( i j ) MOLECULE TYPE: DNA (probe) 

( K i ) SEQUENCE DESCRIPTION: SEQ ID NO:205: 

TGGTCAAGGG AC 



( 2 ) INFORMAnON FOR SEQ ID NO:206: 

( i ) SEQUENCE CHARACTERISTICS: 
, ( A ) LENCTTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D)TOPOLOGY- linear 

( I i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPHON: SEQ ID NO:206: 



( 2 ) INFORMAnON FOR SEQ D NO:207: 

( i ) SEQUENCE CHARACTERISnCS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE; DKA (probe) 
( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:207: 
AGGATGGTOG TC 



( 2 ) [NFORMAnON FOR SEQ D) NO:208: 

( i ) SEQUENCE CHARACTERISTICS: 
( A)LENGTH: 12 base pain 
( B ) TYPE: nucleic ocid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

C i i )MOLECUUE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:208: 

ACACGGAGGA TG 



( 2 ) INFORMAnON FOR SEQ ID NO;209: 

( i ) SEQUENCE CHARACTERISTICS; 

( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:209; 

TGATTTACAC GG 

( 2 ) INFORMAnON FOR SEQ ID NO:210: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic ocid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE; DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:210: 

GGGATATTGA TTT 



( 2 ) INFORMAnON FOR SEQ ID N0:211: 

( J ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic ocid 
( C ) STRANDEDNESS: single 
( D )T0P0L0GY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:211: 

GTGGCATTTG GA 



( 2 ) INFORMAnON FOR SEQ ID NO:212; 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENC3TH: II base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS; single 
( D ) TOPOLOGY linear 

( i i ) MOLECULE TYPE: DNA (probe) 



( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:212: 
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ACGCGTGGCA T 

( 2 ) INFORMAnON FOR SEO ID N0:2U: 

( I ) SEQUENCE CHARACTERISnCS: . 
( A ) LENGTH: 11 base pain 
( B ) TYPE: nnclcic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEO ID N0:213: 

GGTGAGCGGT G 

( 2 ) [NFORMAnON FOR SEQ ID N0:2U: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:214: 

AGTGGOTGAG GG 

( 2 ) INFORMAnON FOR SEQ ID NO:215: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pain 
( B )TirPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Imear 

(11) MOLECULE TVTE: DNA (pobc) 

( X i ) SEQUENCE DESCRIPTION: SEQ ED NO:215: 

GTATCCTAGT GOG 

( 2 ) INFORMAnON FOR SEQ ID N0:216: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pain 
( B ) TYPE- nucleic acid 
( C ) ffniANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I 1 ) MOLECULE TYPE DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:216: 

TTTGTTGGTA TCC 



( 2 ) INFORMAnON FOR SEQ ID NO:217: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE nucleic acid 
( C ) STRANDEDNESS: single 
( D) TOPOLOGY: linear 

( i I ) MOLECULE TiTPE DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:217: 

GTAGCTTTGT TGG 



( 2 ) INFORMAnON FOR SEQ ID NO:218: 
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( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base psirs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY; Imcai . 

( i i ) MOLECULE TYPE: DNA (probe) 

C X i ) SEQUENCE DESCRIPTION: SEQ ID NO:218: 

TGGCTAGGTT TO 



( 2 ) INFORMAnON FOR SEQ ID N0:219: 

( i ) SEQUENCE CHARACTERISnCS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STTRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:219: 

TAAGGGTGGG TA 



( 2 ) INFORMAnON FOR SEQ ID NO:22ft 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) yrRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:220: 

GTACTGTTAA GGG 



( 2 ) INFORMAnON FOR SEQ ID NO:221: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 14 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE; DNA (probe) 

( X i ) SEQUENCE DESCRIPnON: SEQ ID NO:221: 

TGTACTATGT ACTG 



( 2 ) INFORMAnON FOR SEQ ID NO:222: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYTE: nucleic acid 
( C ) STRANDEDNESS: sbglc 
( D ) TOPOLOGY: linear 

( I 1 ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:222: 

GGCTTTATGT ACT 



( 2 ) INFORMAnON FOR SEQ ID NO:223: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
{ B) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( I i ) MOLECULE TYPE: DNA (probe) 
( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:223: 
AAATGOCTTT AT 

( 2 ) INFORMATION FOR SEQ ID NO:224: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TVTE: nucleic ocid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Imcar 

( i i ) MOLECULE TYPE: DNA (pobe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:224: 

GGTAAATOGC TT 

{ 2 ) INFORMAnON FOR SEQ ID NO:225: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pain 
C B ) TYPE: nucleic add 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

C i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPnON: SEQ ID NO:225: 

TGTACGGTAA ATG 

( 2 ) INFORMAnON FOR SEQ ID NO:226: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: smgle 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:226: 

GTGCTAATGT ACG 



( 2 ) INFORMAnON FOR SEQ ID NO:227: 

( i ) SEQUENCE CKARACTERISnCS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nocteic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i )MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPHON: SEQ ID NO:227: 

TAATGTGCTA ATG 



( 2 ) INFORMAnON FOR SEQ ID NO:228: 

( i ) SEQUENCE CHARACTERISnCS: 
( A ) LENGTH: 11 base poin 
( B ) TYPE: nndeic ftcid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE; DNA (probe) 



( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:228: 
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CATGGGGAGG G 

( 2 ) D^fFORMAnON FOR SEQ ID NO;229: 

' . ( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs ' 
( B ) TVPE: Euclcic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:229: 

TGTAAGCATG GG 

C 2 ) INFORMAHON FOR SEQ ID NO:230: 

( i ) SEQUENCE CHARACTERISTICS: 
(A)LENGTH:U base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Iraear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:230: 

TTGCTTGTAA GCA 

( 2 ) INFORMATION FOR SEQ ID NO:231: 

( i ) SEQUENCE CHARACTERISTICS: 
(A)LENGTH:13basepaire 
( B ) TYPE; nucleic acid 
( C ) STOANDEDNESS: smglc 
( D ) TOPOLOGY: linear 

( ii ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:231: 

TGTACTTCCT TGT 

( 2 ) INFORMAnON FOR SEQ ID NO:232: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) SniANDEDNESS: single 
( D)T0P0LOOY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:232: 

TTGCTGTACT TGC 



( 2 ) [NFORMAnON FOR SEQ ID NO:233: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B )TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D) TOPOLOGY; Ibeai 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:233: 

GGTTGATTCC TG 



( 2 ) INFORMAnON FOR SEQ ID NO:234: 
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( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRAhTDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i )MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:234: 

TTGAGCGTTG AT 



( 2 ) INFORMAnON FOR SEQ ID NO:235: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) CTRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:235: 

GTGATAGTTG AGO 



( 2 ) INFORMAnON FOR SEQ ID NO:236: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:236: 

TTGATGTGTG ATA 



( 2 ) INFORMAnON FOR SEQ ID NO:237: 

( j ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pain 
( B ) TYPE: niKleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Ibcar 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPnON: SEQ ID NO:237: 



( 2 ) INFORMAnON FOR SEQ ID NO:23& ; 

( i ) SEQUENCE CHARACTERISTICS: 
( A )LENGTH: 12bas«paiis 
( B )TYPE: nncleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Ibear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:238: 



( 2 ) INFORMAnON FOR SEQ ID NO:239: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base paira 
( B )TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE- DNA (probe) 
( X i ) SEQUENCE DESCRIPTION: SEQ □) NO:239: 
ATTTGGAGTT GC 12 



( 2 ) INFORMAnON FOR SEQ ID NO:240: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic odd 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( )t i ) SEQUENCE DESCRIPTION: SEQ ID NO:240: 

TACCGTACAA TAT 



( 2 ) INFORMATION FOR SEQ ID NO:24l: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Imear 

( i I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:24l: 

TGGTACCGTA CAA 13 



( 2 ) DVFORMAnON FOR SEQ ID NO:242- 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:242: 

TATTTATGGT ACC 



( 2 ) INFORMAnON FOR SEQ ID NO:243: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY linear 

( i i ) MOLECULE TYPE: DNA (pobe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:243: 

GGTCAAGTAT TTA 



( 2 ) INFORMAnON FOR SEQ ID NO:244: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) SrrRA>fDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE- DNA (probe) 



( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:244; 



5,837,832 

127 128 

-continued 



TACAGGTGGT CAA 



( 2 ) INFORMAnON FOR SEQ ID NO:245: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENCTH: 13 base pairs 
( B ) TYPE; Doclcic acid 
( C ) STTRAJTOEDNESS: single 
( D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:24S: 

ATGTACTACA GGT 



( 2 ) INFORMATION FOR SEQ ID NO:246: 

( i ) SEQUENCE CHARACreRISnCS: 
( A ) LENGTH: 13 base pain 
( B )TYPE: nucleic acid 
( C ) STRANDEDNESS: sbgle 
( D ) TOPOLOGY: lineal 

C I i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:246: 

GGTTTTTATG TAC 



{ 2 ) INFORMATION FOR SEQ ID NO:247: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic add 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:247: 

GGATTGGGTT TT 



( 2 ) INFORMATION FOR SEQ ID NO:248: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 ba»c pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:248: 

TGTAGGATTG GG 



( 2 ) INFORMATION FOR SEQ ID NO:249: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYTE: nucleic acid 
( C ) mANDEDNESS: sfaglc 
(D)TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (piobe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:249: 



( 2 ) INFORMAnON FOR SEQ ID NO:2S0t 
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C I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B )TYPE:nocleie acid 
. ( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: tincar 

( i i )MOLEaJl£TVPE:DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:250: 

GGGTTTTGAT GT 



( 2 ) INFORMAnON FOR SEQ ID N0:2S1: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH; 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: smgic 
( D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:251: 

GGACGGGGTT T 



( 2 ) INFORMAnON FOR SEQ ID NO:2S2: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pain 
( B ) TYPE- nnclcic acid 
( C ) OTRANDEDNESS: single 
( D)TOP0LOGY: linear 

( i i ) MOUECULE TYPE; DNA (piobe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N6:252: 

GTCAATACTT GGG 



( 2 ) INFORMAnON FOR SEQ ID NO:253: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE- nucleic acid 
( C ) STRANDEDNESS: smgle 
( D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE- DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:253: 

GGGTGAGTCA ATA 



( 2 ) INFORMAnON FOR SEQ [D NO:254: 

( i ) SEQUENCE CHARACTERrsnCS: 
. ( A)LENGTH:12bascpair3 
( B ) TYPE nucleic acid 
( C ) STRANDEDNESS: sbglc 
( D)T0P0LOGY linear 

( i I ) MOLECULE TVPE DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:254: 

TGGGTGAGTC AA 



( 2 ) INFORMAnON FOR SEQ ID NO:255: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENCTTH: 12 base pairs 
( B ) TYPE noclcic acid 
( C ) SnWNDEDNESS: single 
( D ) TOPOLOGY: linear 
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C i i )MOLECUl£ TYPE: DXA (probe) 
( X ) ) SEQUENCE DESCRIPTION: SEQ ID NO:255: 
TGTTGATGGG TG 12. 



( 2 ) INFORMAnON FOR SEQ ID NO:256: 

( i ) SEQUENCE CHARACTERISnCS: 
( A ) LENGTH: 12 base pairs 
( B )TVPE-nTiclcicocid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:256: 

CGGTTGTTGA TG 12 



( 2 ) INFORMAnON FOR SEQ ID NO:2S7: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
{ B )TVPE:niideic acid 
( C ) STRANDEDNESS; single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPnON: SEQ ID NO:257: 

ACATAGCGGT TG 



( 2 ) INFORMAnON FOR SEQ ID NO:2S8: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic ocid 
( C ) STRANDEDNESS: single 
( D) TOPOLOGY: lineal 

( i i ) MOLECULE TYPE DNA (ptobe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:2S8: 

GAAAATACAT AGC 



( 2 ) INFORMAnON FOR SEQ ID NO:259: 

( i ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 13 base pairs 
( B ) TYPE nucleic ocid 
( C ) STRANDEDNESS: single 
( D )T0POLOGY linear 

( I i )MOLECULETYPE DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:259: 

AATGTACGAA AAT 



( 2 ) INFORMAnON FOR SEQ ID NO:260: 

( i ) SEQUENCE CHARACTERISTICS: 
{ A ) LENGTH: 13 base pairs 
( B ) TYPE nodeic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:260: 
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CCAGTAATGT ACC 



C 2 ) INFORMAnON FOR SEQ D NO:261: 

( i ) SEQUENCE CHAaACTEHtSnCS: 
( A ) LENOTH: L2 base pain 
( B }'IYFE:doc1c!c add 
( C ) STRANDHDNESS: single 
( D ) TOPOLOGY: linear 

( M ) MOLECULE TYPE: DNA (probe) 

( X 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:26l: 

TGGCTGGCAG TA 



( 2 ) INFORMAHON FOR SEQ [D NO:262: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) SnriANDEDNESS: single 
( D ) TOPOLOGY: Ibcar 

( 1 i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:262: 

TCATGGTGGC TG 



( 2 ) INFORMATION FOR SEQ (D NO:263: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: U base pain 
( B ) TYPE: nucleic acid 
( C ) STOANDEDNESS: single 
( D)TOPOLOCY:Ikcaj 

C I 1 ) MOLECULE TYPE: DNA (pobe) 

( X i ) SEQUENCE DESCRimON: SEQ ID NO:263: 

ACAATATTCA TGG 



( 2 ) INFORMATION FOR SEQ ID NO:264: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B )TYPE:nn£leic add 
( C ) STRANDEDNESS: sfaglc 
( D ) TOPOLOGY: linear 

( i 1 ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:264: 

TAGAATCTTA OCT 



( 2 ) INFORMAnON FOR SEQ ID NO:265: 

( I ) SEQUENCE CHARACreRISncS: 
( A ) LENCFTH: 13 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
< D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE; DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:265: 

TTTAAATTAG AAT 



( 2 ) INFORMAnON FOR SEQ ID NO:266: 
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( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: Qocleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

.( i i ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:266: 

GAATAAGTTT AAA 



( 2 ) [NFORMAnON FOR SEQ ID NO:267: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DxVA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:267: 

GAACACAGAA TAA 



( 2 ) INFORMAnON FOR SEQ ID NO:268: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LE>'GrH: 13 base pairs 
( B ) TYPE: nnctcic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:268: 

AAACAACAGA GAA 



( 2 ) INFORMAnON FOR SEQ ID NO:269: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: smgle 
( D ) TOPOLOGY linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:269: 

CCCATGAAAG AA 



( 2 ) DMFORMAnON FOR SEQ ID NO:270: 

( i ) SEQUENCE CHARACTERISTICS: 
( A)LEN'GTH12bascpaiTS 
( B ) TYPE: nncleic acid 
( C ) STRANDEDNESS: sbglc 
( D)TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:270: 

TTCCCCATGA AA 



( 2 ) INFORMAnON FOR SEQ ID NO:271: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: smgle 
( D ) TOPOLOGY: linear 
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C i i ) MOLECULE TYTE:DNA (probe) 
( X i -) SEQUENCE DESCRIPTrON: SEQ ID NO:271: 
ATCTGCTTCC CC 



( 2 ) INFORMAnON FOR SEQ ID N'0:r72: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH; 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRimON: SEQ ID NO:272: 

CAAATCTGCT TC 



( 2 ) INFORMAnON FOR SEQ ID NO:273: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: Dndeic acid 
( C ) STRANDEDNESS: single 
(D)TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:273: 

GGTACCCAAA TC 



( 2 ) INFORMAnON FOR SEQ ID NO:274: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE T)rPE:DXA (probe) 

( X 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:274: 

GGTGGTACCC AA 



( 2 ) INFORMAnON FOR SEQ ID NO:275: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D )TOP0LOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ D NO:275: 

TACTTCGGTG GT 



( 2 ) INFORMAnON FOR SEQ © NO:276: 

( J ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: oodeic acid 
( C ) JTRANDEDNESS: single 
( D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE; DNA (probe) 



( X i ) SEQUENCE DESCRimON: SEQ ID NO:276: 
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TGGAAAAAGG TT 



. ( 2 ) I>fFORMAnON FOR SEQ ID NO:277: 

( i ) SEQUENCE CHARACTERISTICS: 
. ( A ) LEKGTH: 12 base pairs 
( B ) TYPE: noclcic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

C X 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:277: 

GTCCTTGGAA AA 



( 2 ) INFORMAnON FOR SEQ ID NO:278; 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base paire 
( B ) TYPE: nucleic acid 
( C ) SniANDEDNESS: smglc 
( D ) TOPOLOGY: linear 

( M ) MOLECULE TYPE: DNA (probe) 

( X j ) SEQUENCE DESCRIPTION: SEQ ID NO:278: 

ATTTGTCCTT GG 



( 2 ) INFOHMAnON FOR SEQ ID NO:279: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE- nucleic acid 
; ( C ) STRANDEDNESS: single 
( D ) TOPOLOGY linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:279: 

CTCTGATTTG TCC 



( 2 ) INFORMATION FOR SEQ ID NO:280: 

( i ) SEQUENCE CHARACTCRISTICS: 
( A ) LENGTH; 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:280: 

TTTTTCTCTG ATT 



( 2 ) INFORMAnON FOR SEQ ID NO:28l: 

( i ) SEQUENCE CHARACreRISnCS: 
( A ) LENGTH; 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) CTRANDEDNESS; single 
( D)T0POLOGY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:281: 

TAAAGACTTT TTC 



( 2 ) INFORMAnON FOR SEQ ID NO:281- 
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C I ) SEQUENCE CHARACTERISTICS; 

( A ) LENGTH: 13 base pairs 
( B ) TYPE: noclcic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:282: 

GTGOAGTTAA AGA 

( 2 ) INFORMAnON FOR SEQ tt) NO:283: 

( i ) SEQUENCE CHARACreRISnCS: 
( A ) LENGTH; 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY linear 

( i i ) MOLECULE TYPE: DXA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:2S3: 

TGGTGGAGTT AAA 

( 2 ) INFORMATION FOR SEQ ID NO:284: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRAN'DEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:2W: 

TGCTAATGGT GG 

( 2 ) INFORMAnON FOR SEQ ID NO:28S: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nncleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:28S: 

TTGGGTGCTA AT 

( 2 ) INFORMAnON FOR SEQ ID NO:286: 

( i ) SEQUENCE CHARACTERISTICS: 
< A)LENGTH: I2bascpain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS; single 
( D ) TOPOLOGY: Imcai 

( J I ) MOLECULE TYPE; DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:286: 

TAGCTTTGGG TG 



( 2 ) [NFORMAnON FOR SEQ ID NO:287: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B } TYPE: nncleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Imcw 
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( i i ) MOLECULE TYPE: DNA (probe) 
. ( X i ) SEQUENCE DESCRIPTION: SEO ID NO:287: 
.TCTTAOCTTT GG 12 



( 2 ) [NFORMAnON FOR SEQ ID NO:288: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base paira 
( B ) TYPE- nnclcic acid 
( C ) STRANDEDNESS: single 
( D)T0POLOGY: lineal 

C i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:288: 

CACTTGTGCC CTGACTTTCA AC 22 



( 2 ) INFORMAnON FOR SEQ ID NO:289: 

( i ) SEQUENCE CHARACreRISnCS: 
( A ) LENGTH: 49 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: smgle 
( D ) TOPOLOGY: linear 

( i ' i ) MOLECULE TYPE: DNA (probe) 

C X i ) SEQUENCE DESCRIPTION: SEQ ID NO:289: 

ATGCAATTAA . CCCTCACTAA AGGGAGACAC TTGTGCCCTG ACTTTCAAC 49 



( 2 ) INFORAOmON FOR SEQ ID NO:290: . 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 25 base pairs 
( B ) TYPE: nacleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE; DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:290: 

GACCCTGGGC AACCAGCCCT GTCGT 25 



{ 2 ) DJFORMAnON FOR SEQ ID NO:29l: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENCTH: 47 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:291: 

TAATACCACT CACTATAGCG AGGACCCTGG GCAACCAOCC CTGTCGT 47 



( 2 ) INFORMAnON FOR SEQ ID NO:292: 

( i ) SEQUENCE CHARACTERISTICS; 

( A ) LENGTH: 25 base pairs 
( B ) TYPE: nnclcic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE; DNA (probe) 



( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:292: 
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GTAGAATTCT GTTGACTCAG ATTGG 

( 2 ) [NFORMAnON FOR SEQ ID NO:293: 

( i )SEQUENCECHARACreRIST[CS: 
' ( A ) LENGTH: 27 base pairs 
( B ) TYPE- nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DXA(pobc) 

C x i ) SEQUENCE DESCRIPTION: SEO ID NO:293: 

AAATCCATAC AATACTCCAG TATTTGC 

( 2 ) INFORMAnON FOR SEQ ID NO:294: 

( i ) SEQUENCE CHARACIERiynCS: 
( A ) LENGTH: 27 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

C i i ) MOLECULE mE:DNA (probe) 

C X i ) SEQUENCE DESCRIPTrON: SEQ ID NO-J94: 

GATAAGCTTG GGCCTTATCT ATTCCAT 

( 2 ) INFORMAnON FOR SEQ ID NO:295: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 28 base pairs 
( B ) TYPE: nucleic acid . 
( C ) STRANDEDNESS: single 
( D) TOPOLOGY: Imear 

( I i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:295: 

ACCCATCCAA ACGAATGGAG GTTCTTTC 



( 2 ) INFORMATION FOR SEQ ID NO:296: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nnclcic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

C i i ) MOLECULE TYPE: DNA (oligonucleotide) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:296: 

AGCCTAGCTG AA 



1 2 



( 2 ) INFORMAnON FOR SEQ ID NO:297: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base paira 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D)TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (oligonucleotide) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:297: 

TCCGATCGAC TT 



( 2 ) INFORMAnON FOR SEQ ID NO:298: 
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( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE; DNA (pobc) 

( X i ) SEQUENCE DESCRIPTION; SEQ ID NO:298: 

CGCAATTAAC CCTCACTAAA 00 



( 2 ) INFORMAnON FOR SEO ID NO:299: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:299: 

AATTAACCCT CACTAAAGGO AG 



( 2 ) INFORMAnON FOR SEQ ID NO:300: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE; DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:300: 

TAATACGACT CACTATAGGG AG 



( 2 ) INFORMAnON FOR SEQ ID NO:301: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 20 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:301: 

ATTTAGGTGA CACTATAGAA 



( 2 ) INFORMAnON FOR SEQ ID NO:302; 

(i )SEQUENCE CHARACTERISTICS; 

(A >LENGTH: 10 base pairs 
( B ) TYPE: nncleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Imcar 

C i i ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:302: 

GATNATATTT 



( 2 ) INFORMAnON FOR SEQ ID NO:303: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucleic acid 
( C ) SntANDEDNESS; single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE: DNA (probe) 
( X i ) SEQUENCE DESCRIPTTON: SEQ ID.NO:303: 
AGANOATATT 



( 2 ) DJFORMAnON FOR SEQ ID NO:304: 

C i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucleic ocid 
( C ) STRANDEDNESS: smgle 
( D ) TOPOLOGY: tinm 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:3(M: 

AAGNTGATAT 



( 2 ) INFORMAnON FOR SEQ ID NO:30S: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucleic acid 
( C ) CTRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRimON: SEQ ID NO:305: 

AAANATGATA 



( 2 ) INFORMAnON FOR SEQ ID NO:306: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH; tO base pairs 
( B ) TYPE; nucleic ocid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TVPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:306: 

CAANGATGAT 



( 2 ) INFORMAnON FOR SEQ ID NO:307: 

( i ) SEQUENCE CHARACTERISTICS: 
( A)LENGTH: 10 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS; single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:307: 

CCANAGATGA 



( 2 ) INFORMAnON FOR SEQ ID NO:308: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Ibcar 

( i i ) MOLECULE TYPE: DNA (pobc) 



( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:308: 
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ACCNAAGATG 

( 2 ) INFORMAnON FOR SEQ ID NO:309: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I I ) MOLECUUE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIFnON: SEO ID NO:309: 

CACNAAAGAT 

( 2 ) INFORMATION FOR SEQ ID N0;3ia 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pain 
( B ) TYPE; nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

C I i ) MOLECULE TYPE: DNA (probe) 

( jt i ) SEQUENCE DESCRIPTION: SEQ ID NO:310: 

AGAAACNACA 



( 2 ) INFORMATION FOR SEQ ID N0:3U: 

. ( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 16 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

C j i ) MOLECULE TYPE: DNA (probe) 

( 31 i ) SEQUENCE DESCRIPTION: SEQ IDN0:3U: 

ATTTCATTCT GTATTG 



( 2 ) INFORMATION FOR SEQ ID NO:312: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 16 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Ibear 

( i ! ) MOLECULE TYPE: DNA (probe) 



( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:312: 
CCGACTCCAG TCGTTA 



( 2 ) INFORMAnON FOR SEQ ID NO:313: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D )TOPOLOGY: linear 

( i i ) MOLECULE TVPE: DNA (pobe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:313: 



( 2 ) INFORMAnON FOR SEQ ID NO:314: 



5,837,832 

153 

-continued 



( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: lineai 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:314: 

CCGACTACAG TCGTT 



{ 2 ) INFORMAnON FOR SEQ ID NO;315: 

C i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 base pairs 
( B ) TYPE; nnclcic acid 
( C ) yTRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:31S: 

CCGACTCCAG TCGTT 



( 2 ) INFORMAnON FOR SEQ ID N0:316: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 base pairs 
( B ) TYPE; nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:316; 

CCGACTTCAG TCGTT 



( 2 ) INFORMAnON FOR SEQ ID NO:317: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 35 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

C i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:317: 

GTAATTTCTT TTATAGTAGA AACCACAAAG GATAC 



( 2 ) INFORMAnON FOR SEQ ID N0;31& 

( i )SEQUENCE CHARACTERISTICS: 
( A ) LENGTH; 35 base pairs 
( B )TYPE: nucleic acid 
( C ) CTRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (ollgonucleolide) 

( X i ) SEQUENCE DESCRIPTION; SEQ ID N0:318: 

CATTAAAGAA AATATCATCT TTGGTGTTTC CTATG 



( 2 ) INFORMAnON FOR SEQ ID N0:319: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: sbgle 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE: DNA (oligonucleotide) 
( X i ) SEQUENCE DESCRIFTION: SEQ ID N0:319: 
CATTAAAGAA AATATCATTG GTGTTTCCTA TG 



( 2 ) INFORMAHON FOR SEQ ID NO:320: 

( I ) SEQUENCE CHARACTERISnCS: 
( A ) LENGTH: 18 base pairs 
( B ) TYPE: nncleic ocid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

C i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRlFnON: SEQ ID NO:320; 

CATTAAAGAA AATATCAT 



( 2 ) INFORMAnON FOR SEQ ID NO:32l: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 35 base pairs 
( B )TYPE:mttIeic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

C i i ) MOLECULE TYPE: DNA (oligonucleotide) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:32l: 

TATTAAAOAA AATATCATCT TTGGTGTTTC CTATC 



( 2 ) INFORMAnON FOR SEQ ID NO:322: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 35 base pairs 
( B ) TYPE: nocleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (oligonucleotide) 

( X i ) SEQUENCE DESCRIPTTON: SEQ ID NO:322: 

CCTTAAAGAA AATATCATCT TTGGTGTTTC CTAAA 35 



( 2 ) INFORMAnON FOR SEQ ID NO:323: 

( i ) SEQUENCE CHARACTERISnCS: 
( A ) LENGTH: 35 base pairs 
( B )rWE:EueieJcftcid 
( C ) STRANDEDNESS: single 
( p )TOPOLOGY: linear . 

( i i )MOLECULE TYPE: DNA (oligonuckUde) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:323: 

CTTTAAAGAA AATAAAAAAA TTGGTGTTTC CTAAA 35 



( 2 ) INFORMAnON FOR SEQ ID NO:324: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 20 base pairs 
( B ) TVTE: nucleic acid 
( C ) STRAI^EDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:324: 
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GGAAGTCTCC CATTTTAATT 

{ 2 ) INFORMATION FOR SEQ ID NO:325: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 20 base pairs 
( B ) TVTE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY; linear 

C i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:325: 

CCTTCAGAGG GTAAAATTAA 

( 2 ) INFORMARON FOR SEQ ID NO:326: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 20 base pairs 
( B ) TYPE; nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 1 ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:326: 

CCTTCAGAGK GTAAAATTAA 

( 2 ) INFORMATION FOR SEQ ID NO:327: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 20 base pairs 
( B )TYPE; nucleic acid 
( C ) STRANDEDNESS: single 
( D )TOP0L0GY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:327: 

CCTTCAGAGT GTAAAATTAA 

( 2 ) INFORMATION FOR SEQ ID NO:328: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 19 base pairs 
( B )TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY; linear 

( 1 I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:328: 

CCTTCAGAGG GTAAAATCA 



( 2 ) INFORMAnON FOR SEQ ID NO:329: 

( i ) SEQUENCE CHARACTERISTICS: 
C A ) LENGTH: 19 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D )TOPOLOGY: linear 

( i i ) MOLECULE TVTE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:329: 

CCTTCAGAGG GTAAAATTA 



( 2 ) INFORMAnON FOR SEQ ID NO:330: 
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( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 19 base pain 
( B )rVTE: nucleic acid 
( C ) STRANDEDNESS: single 
, .( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA(pobc) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:330: 

GATTCAGAGT GTAAAATAC 

( 2 ) INFORMAnON FOR SEQ ID NO:33l: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 19 base pairs 
( B ) TYPE; nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Imear 

C I i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:33i: 

AAAAAAGAGT GTAAAATGA 

( 2 ) INFORMAnON FOR SEQ ID NO:332: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 35 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i )MOLECULE TYPE: DNA (oligoaucleolidc) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:332: 

CATTAAAOAA AATAACATCA TTGGTGTTTC CTATG 



( 2 ) [NFORMAnON FOR SEQ ID NO:333: 

( i ) SEQUENCE CHARACTERiynCS: 
( A ) LENGTH: 648 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA CoUgonuclcoUde) 

( X 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:333: 

AACAAACCTA CCCACCCTTA ACAGTACATA GTACATAAAG CCATTTACCG TACATAGCAC 60 

ATTACAGTCA AATCCCTTCT CGTCCCCATG GATGACCCCC CTCAGATAGG GGTCCCTTCA 120 

CCACCATCCT CCGTGAAATC AATATCCCCC ACAAGAGTGC TACTCTCCTC GCTCCGGGC.C 180 

CATAACACTT GGGGGTAOCT AAAGTGAACT OTATCCGACA TCTGGTTCCt ACTTCAGGGT 240 

CATAAAGCCT AAATAGCCCA CACGTTCCCC TTAAATAAGA CATCACGATG GATCACAGGT 300 

CTATCACCCT ATTAACCACT CACGGGAGCT CTCCATGCAT TTGGTATTTT CGTCTGGGGG 360 

GTATGCACGC GATAGCATTG CGAGACGCTG GACCCGGAGC ACCCTATGTC GCAGTATCTG 420 

TCTTTOATTC CTGCCTCATC CTATTATTTA TCGCACCTAC GTTCAATATT ACAGGCGAAC 480 

ATACTTACTA AAGTGTGTTA ATTAATTAAT GCTTGTAGGA CATAATAATA ACAATTGAAT 540 

GTCTGCACAG CCACTTTCCA CACAGACATC ATAACAAAAA ATTTCCACCA AACCCCCCCT 600 

CTCCCCCCCT TCTGGCCACA GCACTTAAAC ACATCTCTGC CAAACCCC 64B 



{ 2 ) INFORMAnON FOR SEQ ID NO:334: 
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( i ) SEQUENCE CHAItACTERISnCS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: lincai 

( i i ) MOLECULE TYPE; DNA(pobe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:334: 

GATGCTGAGG AG 



( 2 ) INFORMAnON FOR SEQ ID NO;335: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TirPE:DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:335: 

CTCCTCCCCG GT 



( 2 ) [NFORMAnON FOR SEQ ID NO:336: 

( i ) SEQUENCE CHARACTERISTICS; 

( A)LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE Tn>E:DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:33<): 

ACTCCTCCCC GG 



( 2 ) INFORMAnON FOR SEQ ID NO:337: 

( i ) SEQUENCE CHARACTERlSnCS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) HANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ \D NO:337; 

GACTCCTCCC CG 



( 2 ) INFORMAnON FOR SEQ ID NO:338: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (pobc) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:338: 

CGACTCCTCC CC 



( 2 ) INFORMAnON FOR SEQ ID NO:339: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) HANDEDNESS: single 
( D )TOPOLOGY: linear 
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C i i ) MOLECULE TYPE: DNA (probe) 
C X i ) SEQUENCE DESCRimON: SEQ ID NO:339: 
ACGACTCCTC CC 



( 2 ) INFORMATION FOR SEQ ID NO:340: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
C B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i )MOLECULJE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIFKON: SEQ ID NO:340: 

TACGACTCCT CC 



( 2 ) INFORMAnON FOR SEQ ID NO:341: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) SFRANDEDNESS: single 
( D) TOPOLOGY: linear . 

( i i ) MOLECULE rVPE: DNA (probe) 

( X 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:341: 

CTACGACTCC TC 



( 2 ) INFORMATION FOR SEQ ID.NO:342: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE; nncleic acid 
( C ) STRANDEDNESS: sbglc 
( D)T0P0LOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:342: 

TCTACGACTC CT 



( 2 ) INFORMAnON FOR SEQ ID NO:343: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D)T0P0LOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:343: 

TTCTACGACT CC 



( 2 ) INFORMAnON FOR SEQ ID NO:344: 

( i ) SEQUENCE CHARACTERISTICS: 
( A)LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D) TOPOLOGY: liacai 

( i i ) MOLECULE TYPE; DNA (probe) 



( X 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:344: 
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ATTCTACGAC TC 

C 2 ) INFORMATION FOR SEQ ID N0:M5: 

( i )SEQUENCECHARACTERISTTCS: 
( A ) UENCTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
C D ) TOPOLOGY: linear 

C i i ) MOLECULE TYPE: DNA (probe) 

Cxi) SEQUENCE DESCRIPTION: SEQ U) NO:345: 

TATTCTACGA CT 

( 2 ) INFORMAnON FOR SEQ ID NO:346: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base paint 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TVPE: DNA (probe) 

( It i ) SEQUENCE DESCRIPTION: SEQ ID NO:346: 

CTATTCTACG AC 

( 2 ) INFORMATION FOR SEQ ID NO:347: 

( i ) SEQUENCE CHARACTERISnCS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE; nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

C 1 i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION; SEQ ID NO:347: 

CCTATTCTAC GA 

( 2 ) INFORMATION FOR SEQ ID NO:348: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: tO base pairs 
( B ) TYPE: nncleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

C I i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:348: 

TCCTCCCCGG 



( 2 ) INFORMAnON FOR SEQ ID NO:349: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nncleic acid 
( C ) STRANDEDNESS: single 
( D )TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ED NO:349: 

CTCCTCCCCG 



( 2 ) INFORMAnON FOR SEQ ID NO:3S0: 
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C 1 ) SEQUENCE CHARACIERISnCS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE; nucleic acid 
( C ) CTEIANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 

(x ] ) SEQUENCE DESCRIPTION: SEQ ID NO:350: 

ACTCCTCCCC 

C 2 ) INFORMAHON FOR SEQ ID NO:351: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pain 
( B ) TYPE: nucleic acid 
( C ) CTRANDEDNESS: single 
( D ) TOPOLOGY: linear 

C i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ D N0:3S1: 

GACTCCTCCC 



( 2 ) INFORMAnON FOR SEQ ID NO:352: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucleic acid 
( C ) CTRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

. ( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:352: 

CGACTCCTCC 



( 2 ) INFORMAnON FOR SEQ ID NO:353: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucleic acid 
( C ) STTRANDEDNESS: single 
( D ) TOPOLOGY: Imcar 

( i i ) MOLECULE TYPE: DNA (probe) 

C X i ) SEQUENCE DESCRIPTION: SEQ ID NO:353: 



( 2 ) INFORMAnON FOR SEQ ID NO:354: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: sbgic 
( D ) TOPOLOGY; linear 

( 1 i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:354: 

TACGACTCCT 



( 2 ) INFORMAnON FOR SEQ ID NO:355: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE rVTE-DNA (probe) 
( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:355: 
CTACGACTCC . . 

( 2 ) INFORMAnON FOR SEQ ID NO:356: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: tObasepaire 
( B ) TYPE: nucleic acid 
( C ) STRA>rDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

C X i ) SEQUENCE DESCRimON: SEQ ID NO;356: 

TCTACGACTC 

( 2 ) INFORMAnON FOR SEQ ID NO:357: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucleic acid 
( C ) STTBANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:357: 

TTCTACGACT 

( 2 ) INFORMAnON FOR SEQ ID NO:3S8: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) UENCTH: 10 base pairs 
( B ) TYPE: nucleic odd 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:358; 

ATTCTACGAC 



{ 2 ) INFORMAnON FOR SEQ ID NO:359: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE nucleic acid 
< C ) STOANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:359: 

TATTCTACGA 



( 2 ) INFORMAnON FOR SEQ ID NO:360: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 184 base pairs 
( B ) TYPE: nucleic acid 
( C ) STOANDEDNESS: single 
( D)TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (oligonucleotide) 



C X i ) SEQUENCE DESCRIPTION: SEQ ID NO;360: 
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TACTCCCCTG CCCTCAACAA GATGTTTTGC CAACTGGCCA AGACCTGCCC TGTGCAGCWG 60 
KGGGWWGATT CCACACCCCC GCCCGGCACC CGCGTCCGCG CCATGGCCAT CTACAAGCAG 120 
TCACAGCACA TGACGGAGGW WGKGAGGCGC TGCCCCCACC A T G A G C G C Y G , C Y C A G A T A G C 180 

18 4 
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claim- 3'-TAGTAGXAACCACAA (SEQ ID. N0:13); 

1. An array of oHgonucleoUde probes immobilized on a 3'-AGTAGAXACCACAAA (SEQ ID. No:14); 
solid support, said array having at least 100 probes and no 3'-GTAGAAXCCACAAAG (SEQ ID. NO: 15); 
more than 100.000 different oUgonucleotide probes 9 to 20 3*-TAGAAAXCACAAAGG (SEQ ID. NO: 16); and 
nucleotides in length occupying separate known sites in said 15 3'-AGAAACXACAAAGGA (SEQ ID. N0:17); wherein 
array, said oligonucleotide probes comprising at least four each set comprises 4 probes, and X is individually A, G, 
sets of probes: (1) a first set that is exactly complementary c, and T for each set. 

to a reference sequence and comprises probes that com- 9 ^^^y of claim 1, wherein said reference sequence 

pletely span the reference sequence and, relative to the ^ ^ sequence of a D-Ioop region of human mitochondrial 

reference sequence, overlap one another in sequence; and (2) 20 djsja, 

three additional sets of probes, each of which is identical to ^^^^ ^^^^ ^ wherein" said probes are 15 

said first set of probes but for at least one different um-ieotides in length, and said array comprises a first set of 

nucleotide, which different nucleotide is located m the same complementary to a sequence contained in a 

position in each of the three additional sets but which is a ^ ^^^^ ^^^^^^ ^^^80 to 356 of the refer- 

different nucleotide in each set 25 ^ additional sets of probes identical to 

2. The array of claim 1 fiirther compming fourth J 3,ibut for position 7, relative to a 3'-end of a probe, 
'd'f Holb^^^^^^^^^^^ S 'Sd is Jalently attached to the substrate, where, 
^tS^rrS^S^^^^^ foreachofthefouradditionalprobesetsadiffer^^^ 

is a double-stranded nucleic acid and probes complementary 30 ^^^^ ^ 1°^^^^^' ^^^^^ ^'^^ P'°^*^ '""ff. ^""^ "^^^ 

to both strands of said reference are in said array. there is an identical probe in one of the foi^^a^dmond se^ 

4. The array of claim 1, wherein said probes are 12 to 17 and such that the array has between 2500 and 100,000 
nucleotides in length. oligonucleotide probes. 

5. The array of claim 4, wherein said probes are 15 11. The array of claim 1, wherem said reference sequence 
nucleotides in length and attached by a covalent linkage to 35 is a sequence from an exon of a human p53 gene. 

a site on a 3'-endofsaid probes, and said differem nucleotide 12. The array of claim 11, wherein said reference 

is located at position 7, relative to the 3'-end of said probes. sequence comprises at least a 60 nucleotide contiguous 

6. The array of claim 1, wherein said reference sequence sequence from exon 6 of a p53 gene. 

is exon 10 of a CFTR gene, and said array has between 1000 13. The array of claim 11, wherem said reference 

and 100,000 oligonucleotide probes 10 to 18 nucleoUdes in 40 sequence is exon 5 of a p53 gene, said probes are 17 

jejjgjlj nucleotides long, and said array comprises a first set of 

7. The array of claim 6, wherein said array comprises a set probes exactly complementary to said sequence and at least 
of probes comprising a specific nucleotide sequence selected three additional sets of probes, each set comprising probes 
from the group of sequences consisting of: identical to said first set but for a nucleotide at position 7, 
S'-TTTATAXTAG (SEQ ID. NO:302); 45 relative to a 3'-end of a probe, which 3'-end is covalently 
S'-TTATAGXAGA (SEQ ID. NO:303); attached to the substrate, which nucleotide is different from 
3'-TArAGTXGAA (SEQ ID . NO:304); a nucleotide at this position in a corresponding probe of said 
3'-ArAGTAXAAA(SEQID.NO:305); first set. . . • u r 
3'.TAGTAGXAAC (SEQ ID. NO:306); 14. The array of claim 1, wherem said probes are oh- 
3*-AGTAGAXACC (SEQ ID. NO:307); 50 godeoxyribonucleotides, 

3'-GTAGAAXCCA (SEQ ID. NO:308); 15. The array of claim 1. wherein said array has between 

3'-TAGAAAXCAC (SEQ ID. NO:309); and 10,000 and 100,000 probes. 

3'.AGAAACXACA (SEQ ID. N0:310); wherein each set 16. The array of claim 1, wherein the reference sequence 
comprises 4 probes, and X is individually A, G, C, and T is from a human immunodeficiency vims, 

for each set. 55 17. The array of claim 16, wherein the reference sequence 

8. The array of claim 6, wherein said group of sequences is from a reverse transcriptase gene of the human immuno- 
consistsof: deficiency virus. 

3*-TTTArAXTAGAAACC (SEQ ID. N0:9); 18. The array of claim 1, wherem said probes are immo- 

3'-TTArAGXAGAAACCA (SEQ ID. NO:10); bilized to said soUd support via a linker. 



3'-TArAGTXGAAACCAC (SEQ ID. N0:11); 
3'.ArAGTAXAAACCACA(SEQ ID. N0:12); 



